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ABSTRACT 


Thie peper discusses some problems in the machine tranetetion of natural 
language, in particuler, for translation from Geraan into English. © An 
Implementation of come parts of the translating Process hes been bul it. The 
‘eysten consiste of s Gernen interpretive gremer, to téke in German text and 
output @ set of semantic representations, and a generetor, to produce Eng! ieh 
sentences from single semantic representations. Although besed on the 
assumption that underetending ls necessary for cerrest translation of text, 
the eystem dees net now contain an understanding component te choose betueen 
eemantic representations. The representation of knetiedge end its use in 
natural lenguege understanding is @ reeserch area that |e already under 
intensive investigation elsewhere. The implementation described here is based 
on a systemic grapmer analysis of Gernen end English, and it applies and 
extends the work of tinegrad. Special attention is pald te questions of 
semantic representation in a muiti- cee setting we to stylistic issues in 
English cnereti 


This report is a revised version of a theels subsitted to the Departeent of 
Etectrical Engineering on Jenuary 23, 1974 In pertial fulfillment af the. 
requirenents for the Degrees of Nester of Science and Eleatrical Engineer. 
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Chapter. 1 --. Introduction... 


1.1 The Problem us. >, 

The following pages descr ibe.a-model af the transtat ing process, in 
particular, 8 syste depigned .to accept Gernen. text and. produce. an Engi ish 
tranalation. ‘The. mode! he not in-9ny sence. -complete.one, aepecial:iy ba the 
crucial area of. languege under atendings ; Ao: lnphenentenion.of eome-parte.of 
the model, however, hae been. wei tten, In.vi quot -$he -coneiderale.-hietory of 
the mechanical transiation problea, 1 shovid.atreee.phat the objective.of this 
project wae not to construct a large-scale working oyeten but rather to see 
how far some existing. programs end techniques could go in handling @ group of 
problems that come up in text. | 

Does it even make sense. to speak.of.nechanical. icons iaiion 6.380 
independent. probl an? In the early 66's. it.uas widely.recogoized that 
mechanical. translation required. the full resourees.of. language understanding. 
‘If the translating procese.is strictly a matter .0f under-atandiag in one... 
language .and. generating in.enother, doss the, nechaqice! translation problem as 
such merit attention? 1 think it is, fair to. gnewer.tyae" te:-thjs.queation. 
Translation seems to require. the full. powsr of- language interpretation but not 
the full power, of generation. The. hardest, problem, of..genecation, daciding 
what to say and organizing the peseage,..is. geperel ly, pot..at jsaue. in 
translation. Se translation offers. a. someuhat. circumscr iped. context. within 
which to discuss |spues of understanding, language. representation, and ? 
production, — ee A a 

The translation problem also has the attraction of s.readable output, 
namely the translation. The.output ia a.criterion by which success may be 
measured, al though.of course there are a pumber of pitfalia here: first. .it is 


spaces 2 a AE tert her fhe 
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not hard to recognize, but rather difficult to describe, what @ good 
transiation 19. Second, @ limited ayetem like the one discussed here can be 
groomed to accept.particular séritentes gracefully,.so that ite performance 
cannot ‘truly be jgdged ei ther on thétext it-can pretdes successfully or on 
the.no doubt unl im) téd amount of text iter cmnfen LESLEY @putter and die. Once 
we know the scope.of @ partioutar system, houdvér, as oah use a given output, 
or @ back-of ene, a9, @°taete for compertbine*erd evetgations. Through: the nT 


problem, then, it le pessibie to consider soné:péputens eommen to a fusber of 


areas of naturel language bigcsulnes a ei 9 ne. dS pmetd = begs 4 


de? The, behing ‘of the Prouton — 

If the translating system discussed here: fot tna eense’d “solution” 
to the mechanical “translation: protons: aa oan present tha evolution that 
cturrad Sines Warren’ Méever' ei 949- penbriifiduk on: the subject; “Fir tte 2s 
year “nleterys: mechanical teantatn na ret’ bt yaa 1hedl ved with a® 
single protlew, fut: rather theft have beds sertés ‘of probteas: es etch: 
syeten that wae. ult: Pointad’up other - pce ‘thet. nebited' Sttentton.: Cae liest 
‘attempts at NF verecessenttalty nechehi zed ict! onar lea; Bing subit! tut tone 
on. a word-hy-Hord, feat.” Any wore : invee| ved" pheciied ing’ nan ‘1ett to’ haman pre- 
“and post-edl tora. “When it Becae: clear’ that WorgbgoHord’ ‘subst T tutTons ald 
not produce. acceptable: traietetions, ‘the: bridle ‘ues fet ined te-inctude 
syntactic recoptition. Attention In’ the: 68 a! ‘and erty 68" nas cantered on 
parsing uith, fob exemple, Bettinger sé preaittive atsthela; “or top-down, 
approach and. the work. of. Yvnge’s. group at NIT. _ The more sophieticated the 
approach td ayntax. became, hovever, the more. eharpiy the nagging problem of 
syntactic aubiguity cone into focus, © Methantéal transtation bégan to mean 


has o 


" semantice ae well ae syntax; but It ues not akactly Clear whet semantics . 


il 


meant. Three approeches. represented limited. ‘oneuer es . probap! lities of 
lexical co-occurence ould be weed. to.celert. the *likeliget’ pares. 9 group of 
be. veed. te, restrict. 


bs baer 


semantic categories such as sbstrect-concrete pauls be 
participants in a grammatical relation, and #. fixed, set. of keywords. could be 
used to choose possible interpretations and, hence poss|pip parses. trie dal 
as Bar-Hi|lel. and athers. emacked, gyro. if, auch ue thode. ¢ gould claim 9% 
reliability, they, would have the diequigting property, thet. ut would me 
impossible to predict where the errors vould, pocur,,,.A.eystenatic 4 — to 


semantice wae necessary. Joh iNeed, Caicos ae ge te 
In recognition of the, sarsrieaity oe: tal,” nes" reohact. scons aslee 

problem, support for short-term, i.e, prantical,. prolscts dried up... The early 
68s say a shift auay from attempts to build working systems to an emphasis on 
more basic research. Limited dadugtion cane Into use in systems auch as that 
of Rephae! (29). xith tha implication that language ae Jnelugled.not. only.» 
static "meaning™ tut aleo # deductive ahi) ity, a9 nal,,,,.Jhen, in.1978, with the 
deve lopment. of PLANNER and Hinogred’s ayant Lon-anewering. system, the, "| Imi ted” 
was deleted, and more ganeral deductive gil ity wee, advacated for. natural 
language processing. Impbicit .in Hinggrad’s system, but not. claar ly. evident 
because of ite question-snener.ing nature, “aa, algo, tha. saApagn tion that, a 
“sentence. is not really. an..indepandant semantic init but inatead, part .of. 8 
larger. context. Eugene Cherniak’e,.detalied anslyaia (2) of, the problans, of 
_ dealing with context .impleg.that .centengachycasntence epproechas myst go the 
way of Hord-by-Hord tranelating quatess, [hia lenat te aaythat the sentence 
is not an. impor.tent baajc.unit, but It doge imply thet the only queten that 
has a chance of success at high Jaye! language prangeeing |e one that can deal 
with the interrelationships within text. 


While we have not yet come full circte, mechanical transiation is 


OAPROE Srey ae ag ask oy fe cleegecte se ot cee ow, 
r - Hs 7 f 
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‘surfacing again as the name of the prodles, of, more properly, as the name of 
one problem, since language research haé diverged considerably since thé 
inception of reseerch on MT. Avoued translation work is undernay et the 
University of Texas at Austin, the University or Gili férnia at Berkeley (3), 
one company, Légos 
Development, is warketing an MT syste for Engl ish te Russien, aeong other 


at Montreal, and at Stanford Giilks, 371. “At teas 


lang uages. The éystee ‘Felies on powtedi tires arth 6 owt eye of the source 
language, but the ‘company cialas & high: pate es efi EL 


‘before ‘posted! tting. 
I could not get any detailed information about sesaitic’ processing done In the 
Logos ITI syste, St from examining the compari‘s liteaturs, 1 get the 
impression that some sort of sesiantic’ tipeccheckirig ie'used.  * 


“Assuming that the test of a trandlating syetéa Te rot Ite ability to 
handle isolated sentences, but rather ite ai fity to deal with connected text, 
I selected a paragraph from a .japer by HédbeInenn on Gctopuses (17). The text 
was used as a gédl for the system, and I Have tTéu téhariiie, In ee general a 

“Way ae podsibte, the tipes of di¢ticult ive ‘thet the perdigript presente. 
Chotsing texte seeséd mich sore desirable tHe arlting ‘thes, since consciously 
nanufactured examples often Have an uiratueel ‘sound, vhtle the mst ‘Irinocent-_ 
ldoking sample of “fourd® text Wilf Ueaally conteln @ number of eubtte 
difficulttes. Tre fult paragraph Is presented and disdiseet ‘petou, but let me 
first display the acconp! isnments of thé translsting eystes. 

Etn deutch sichtwares Zétchen fOr die te Nérverieysten ver fautenden 
Erragungen ist das Spiel der Chrosiatophoren dér-Cephatopoden. 
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A clearly visible indication of the excitationa. that rug. through the 
nervoup systen, 1s the play of the, chramatopneres, ofthe, gape! opod. 
The skeptical wil! aay that shia. 19. not a, vecw ,lanoplen, ga tput,. but let me 
remark that a major factor limiting. the, output of the, syaten, 2: the very 
constrained scope of ‘the Englian dictionary _..A)Abeyoh. sp, Jppor tent problems 
_ still do remain, it would take only Arelatively grensioes routine work 


again that are noe 5 onan pkeameeiet aes 
understanding, #0 that digas lguation, af. werst aapan and Berkley Hpetences of 
coreference was. dona. by the.,ueer,...ThI§. unger atending. syshnae.4e8 not done for 
the sake of exhibiting grangLation. without snderasand) ngp bw nether it was 
In the Interest, of getting an.putqut: $ron.# Berd IAL Ave NMP: cc 
Let us Jopk at, the full text..conelde esi dl ecuas. seine: of the Brop lene: 
that a mechanical, trenale Lan. sunken Heels De, deface, 18; Ate German 
text, and following 4 Sa Po Ate aichlinanaiias natnie 
aah douf - .beeas aiden 
(1) Ein aastiiahy pakiea one a ie Inari 
der, Cente! onadan.. Japer unter. der; Hawt, eqmeden, ORRe os 
‘braun, echuerz, wiplett ager. karmlanad, petdraten, Za1, 
(5) die. sich. entveder, Zuamunans iter: Qaer.. dena, APM... 
ansetzende Muskeln | fachenh t auponbral tek, warden, 
kOnnen,., Mit, Jbrar. | Hu fe, _verntoan, gich die Dreatls, 
zu einem geuissen, Grade der, Farka. dea. Unteranundes..... 22, 
anzupassen. Die das RESO als ia! Yoran sesenden 


oe Ws 


(18) Reize werden. nicht. nur. dureh 1 Augeny J 


bere Sie POQid ya fare s oe B 
Si Aaa 


durch Giese. Faerbze} len pelbat auf 


Paes MS 
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Kraken bel piStzlicher Zunehwe<der Lictitintensi tit 


ganz dunke!, auch wenn ‘Sie gepteniiot sind. 
hihgt der Zuwtand deb Bal Tung oder’ Auedehnuny der 
(15) ae a Wenn 


(28) rauh let; eine Veruchtedene Wirkung buf die 
Sauyniptdienttarnt worden:eind,’ bivitit’ etinatg 
gelbgtau, -fartt sich suf Relzulig' sour: novty danke! 
(Stetnacths fit dewcderct: Livitrelz hervér 

(25) Chroma tophar ertep fet pt tegen Beungung 

einherzugeherl; wae-stch besetiders bet 


beobachten i’set. Auch der Trichter: pflegt dabei Wasser . 


 guecuspritzen. Ob die Céphe ani’ eeitiet aut Farben 


‘reagieren, iet nicht‘bekannt.. NiH ven Héee solten sie 
“aanche: Int der’‘Tlefeee: lepentitn Tintentiwehe: In 

efdé Leuchtorgene 
besitzer; von denen man anhtmnt, dbs die zim 
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- Stet Libevder: farbend! inde | 


- verschiedenen bunten Farben esther 


gegenseitigen Sichauf finden der Gesctilechter dienen, 80 
(35) 


aa 


scheint das zum windesten fOr dices. Formen fOr einen 
Fitenetnh: au sprechen: Eligahinds’” = 
Untersuchungen Sber einen etudigen Farbensinn der 
Cephatopoden: sind sehr eruinecht: Neverdings hat 


iS 


Frohlich Unterschiede in den vom Auge abgelei teten 
(48) Aktionsstrdsen auf. verech|edene. ,Farbee) ze. festgeatel it, 


hyd Ts 


Farben zu unterecheiden am 


ESR MAB OTB: 


A clearly Visible ladication of the secteatiins that run “through the 
nervous system is the play of the pidge aaa ofthe cephalopod, those 


celle that lie under the The cia e cotor: Brahe brah: 

purple, or caratid Ma.” Be sain, vie 
radially fastened sles ue be eaten out oe skin eur face. With 
their help, the animal Te ante: to” acapt® ta od te the color ‘of 


its background. The stimuli that. tri the p f the. ibier carte 
-ar@ percetved not’ only thre cut the Wont deattu tna : 
themselves, So it is that cepha ‘become quite dark in response to a 
sudden increase In opt Tatanel ty | evel hel theg hel’ bEeA’ BTinded. On 
‘the other hand, the state of contraction or celexation of tbe. 
chromatophores’ 1s ated depettien't’ on’ the Gutkers.° Mlen thos” "are not 


rasping, the chromatophores are genera] iy at r A eden to 
We thine However?” theh tha ch sn } . ate: an obiag: 

“nature of the bottom, whether jt Is 

> Tf uence cof the’ *Ghreill topndre re oth oF Coy ef pee all been | 


removed resains a} Vlowieh rout aay Hu tipl 99. dark if given 
a stimutus (Stein eters ip oy oF Seteaeee es elicited by 
light stimulus, there ace ce Genera ty J 


ve which are 
particularty easy ts ob Ridin the Bae tlhe” the ambulatory 


funnel usually reeulrte gut et seat line ther: Rms pas to 


‘cofor ‘id net ‘knéan 
gi 8 soler oli em of char araarar le ome ne “one 


assumes these serve the pur of iQter: 
“then tor at Teast these® ores "one apt Btne ee ine! Rhone abe 8 nid of 


color. Thorough Investigati: a of ¢ oc en 
cephat opods te ‘pe’ very’ ‘cavirontis "hocatt ge tte, cals 
distinguish differences in the neural lees. out by 

given different *coltred stimuli, CATCH peice a WiPirctat ea “that 
these animals are able to. Sistingul sh. Colors. 


The choices that had to be made in producing the translation will be discussed 


in more detail In the chaptére that: follow, ‘but a ‘cursory “look at the text 


pevsare the following sorts Of probleme: ” 


SYNTACTIC AMBIGUITY: Choice of a paree can hive rather striking i 


influence on a translation. For example, {ine 28 and 29 tan be parsed In two 


“ @bte Bt 


is 


ways: 
Ob (die Cephetopeden) selbst auf Farben remaieren. 
Whether (the Cephelopode) even react to color eae 


Ob (die Caphetapoden selbst) “gut Farben ‘reagieren.. 
Whether (the cepaphalopods themselves) react to 


GQlor coe 


{b) UN REFERENCES are isle aes it ad paragraph. In line 7, to 
uhat does. ibeer eter? The. musclge? The. cheonatephores? “Itds our. 
yaderstesding of the. precading sentenagy.. rather then, some, herd and fast rule, 
that datersings.» choice, here. “The: phaneneneg.of opowe raferance.is not 
only limited to. personel proneune,.. out Voelten. svtetties reneune. ‘euch as 
denen in line 33, and demonstrative. ‘ediogti af.veed, 
and jgng in tines ¥6 and 1a. In addition, compmande [15 

| be used to refer: Decherd (Tine 27). of te refer fiewerd Uline 41) 


. ean to Eng! ish, ry ‘con otinn get _~ with translating an 
anbiguous pronoun Keference with on squelli 


Going fron Gores 


vba ous. Engl Leb prandun. When a 
sentence needs cqneldereble rearranging, bowser, this wil! not eluays work. 
For example, we nay ‘heave to put in a ban. srep. bere, the. original had a 
profioun. Note. sing that the. (alted, susoees ‘of ‘thle, siproech Je:pactly @ by- 
“product of the lenguage psire chosen. Gping troe English into German, where 
ae agreement is necessery between prahoun ind referrent, @ definite choice 
_ of pronoun referrent wovld be forced wore often thes yhen. the direction of 
translation |e frow German inte Eegiieh, So while a. decision. abgut pronoun 
reference ull! not be forced in every. ceen, a teanplating, oysten must. be. 
equipped to make the decision uhen. necessary. 

te) As if pronauns ere not enough, “Ve alo have e,seciee of NOUN. 
REFERENCES. Line 7 uses “die Tiere" to refer back to the cephalopods of line 
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3, and it requires an understanding of the. paragraph, to. determine that the two 
noun groups are coreferential, . If. the nqun.groups,erg coreferentiel, we can 
transiate."die Tiere’ simply as “the aninais’ (or, ag. ‘the animal,” since this 
sounds more natural.in English). If,.on.the.etper.band,, "die Tiere" refers to 

anivale in general, then. the English would have.ta.be Just “anivale.” 
(CHANGES OF TOPIG! Taking the, sentencs.agacting uith, "eine, Eledene” on 
line 21, and reading the pain clause.of the, next sentence. it is.npt specified 

Whose chromatophores and.arks we are talking, about.-. the.oe me's or those of 

_., 8ny.cephalopod. There are. clues, but ue. have, te.uaderstend. the congext to 

_ find. thew, Probably, the, easiest way, to. decide.Ja tp uae, the. information. in 
_, the subordinate claype: ve. can. rengon.,that, spp)a,Je, a, suoc\ees.of cephelapode, 
but not_of eledenes, 9p the. discussion. mat, pertain, te,cqphelppode, In.,general 

As in, (b)_ and (c). above, .a.tranalation.can veyelly.gas,by:withgut keeping 


track of changes. of topic....But. si tuasions sl.! come, yp. in. which: webene, to 
make something explicit that was left implicit in the original, and, .for .this, 


knowing the topic could be crucial. 
(e) Another very difficult. area, is WORD CHOICE. The paragraph here is a 
mixture of technical and commen. language. .JO.general,:the.mere.common the 
_ Word, the more varied. its uses. For sympie, to, tranalate. the word. Farke from 
line 8. the dictionary (28). says ve. mugt chopse.betypen:.. 

1. color, tint, hue 
- Stain, paints dum. ...., 
3. complexion 


N 


PORIP +28» BrpROUN.and.noUn 
reference, we stii! have. to know pores. . there. are fing distinctions between 


Therefore, even if we know enough to untar 


dj fferent Horde, differences of. connotation, patupen synonuns,..and i seuss. of 
what words combine beat with what other Word. ua 
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“Note ‘that technical tenguage ts often quite a ‘bit easter to work with in 
terms of word chotce, since the earings’ tend°to be céréful ij circumscr lbed 
and usages ‘are tteited. “This-slpnt deed one to°think’ that @ highty technical 
tent would bea whtz“tor-e mechenival trenetator. Comparéid to poetry, of 
course, It-woutd be. “Neverthetess, “even the-wost ‘technical sriting cannot 
avotd=prepesttione, sigind::pr-apostttore” for: A equivélant syntactic 
structures) ere :probeply the:sest. Ce ee ee ‘Tengtage. “Word 
‘choice, therefore, is-e problen ‘that is:etaays with: us: 

“What these ‘tive:probienws - (8) ‘throdgh’ (6) -“hive In common is that they 
are unified probisme in name oniy; sect pertttater eltuation presents ite oun 
dl tfical ties and requires |ts oun unique eclttisn. “This, In the end, Ie what ; 
makes language precuasing. 8o°di fficult.’’ The’ biet that’ eadl! etale research 
project ea sto eto siendeatine”e iguap Ving? proviteste, “ith the ‘hope’ that’ | 
stmt tar -téchntques:catrbe weed ‘to-deel’ isl th the-vertous other cases thet’ 


‘occur. 


1.4 Liwitettons 

“Before ‘beginning in eernest, ‘it-wight Me's good ides to éketch out the 
boundar tes df the project: Some°aress could’ tive bedi incorporated into the 
project given more time, while sone ae hdjor preblewe tn their oun right. So 
here is a list of what the system is not: ” 
(a) — The project was limited to uritten teit end net speach. Thus, problems 
like intonation, fixing word boundaries, and vocal differences betueen 
speakers did not-have to: be: toudhed. — 
{b) Atao, Input we assumed to be premmntically well-formed! ‘no attempt was 
wade to extract a-ewesage frou'a fore thet use net ful ly-defined for the 
_eysten. ‘Readers interested in the probiéiise of Handling ungrammatical text are 
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referred to Levine (21), who has developed a program for grading German _ 
sentences in a classroom situation. 
_, te) No understanding. of the text ie attempted. 5 nig is Recquse the. Prapions 


of representing knouledge: and using this _ knowledge ty... dieanbiquation are 
HOU SN Ubee Tb tHe. Bes; viel ee: ies eae! 


extremely complex. A considerable amount of reeserch is underwey in this 


area, and wpetever results appeer,. in a, field ef natural beh ert ave 


sf ign sere Sag 


under standing “wilt have direct relevence for translation. The German 


WiSPA Shi Pifod Fuge 


interpretive t doseage in the Syaton is ‘connected to the Engl ish generator vie 


sti tt 
9. under standing by-paee routine that aquirge een intercession. ‘This \s 
. intended for denonetration purposes oly, eh 


(a) The eyaten is a very poor céiweheutioheliat.- The German grammer, hich 
is the most complete component in the oysten, is prepared to parse cancion 
and commands, but the rast of Ba upton \e geared strict for deol erative 


Site. PoPAR ar at 


_ text. . This [mi tation As the rewult a tine constraints, | but sl00, reflecte 


TaN fi T&D pat 


‘The reader by. now hes. an Iden of batt the 


ypten dose n not de, Fabict the 


fol louing pages, wi hopatul ly woke At clear nat only what the system does, 


See pag Pages 


_. but how ‘At goes. about doing it. 1 have tried &, give. Eng! ish, equivalents for 
German examples, so that a Kouil eae of Gerson should not be necessary. . 
Familiarity with Winograd’s system (39) end Lis, beaintver would be useful. 


eee sane. “ae 


i. 5 Organization o of the Syetn 


toed ble fed teat i 


The implementation Hes url tten in, IMGLISP and rune: on the 2 POP-16 


egies 


en ern ae. 


7 Incompatible Time-Sher ing System at Meds Ve Project, mmc, ‘The tranelator. has 
six major componente, whose funct jone are outlined below. It currently 
, occupies about 188K of core, uhich includes the LISP interpreter. The. system 


has only been run interpretively, but could be compiled. The major components 


are: 


The GERMAN DICTIONARY, which contains: syetactic arid seusntic information for 
the spproxinately 2a oornen words cuerentiy sett’ in the custom. - 


The morphology routing catled Por le the: firet pase in processing a text. 
Given a sentence; the input routine: enatyres euch word inte its root and 

| endings, then uses the dictionary: to constract poseibie oyntectic feature 
lists for the cominatton. Tie syntactic information, “atong with senantice 
picked up: from the Satine detini tion, ie ‘then seeeciated vith the. ‘sentence 


[gh ei Pap is Co ee a 


word. 


The ERIN GRAMMAR ie written irr PROGRARS a, 2 L18P-euvedded language designed 


toery oat 


| as a ‘“pramear-vri ting tool ‘(idinogred =. “the greener ‘peutines” ‘use the ‘result 


Peat ors 


sac To do this: ‘the 


of INPUT to construct a single peruse tree of the 
grammar interacts with: the semantic component in an “attbapt: to ftmit syntactic 
anbi gui ties by: the: limited semantic. case ohbek ing currently ‘ep lemented. 
Where a syntactic ani gi ty cannot be ‘eliminate iimeedistely, 0 a choice is made . 
—_ ‘packup: ie used if ‘necessary. Ae ee ae oe es 
When the grammer hae pores @ section of the sentence, it calls the SEMANTIC 
COMPONENT for initiel ‘senantic processing. ‘Sebaritice constructs as many 
representations as possible. Te ‘thie number is mere, eyntex fe asked to 
reparee the section. If the number Is rion-reil bil poselbia | interpretations 
are carried forward. Theee sesantic ua tui ties’ abit’ be’ of tinater | later in 
‘the sentence: oi ther by. the ‘secentic: component! or by i veer “Intervention, 80 that 
only @ single representation ie sont ‘to: the generator. _* 
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Associated with the semantic structure ere the routines of the ENGLISH. 
DICTIONARY which, when executed, supply a set of selevent English words. 
Using these words along with other information available from the semantic 
representation, the ENGLISH GENERATOR can then construct an English equivalent 
for the input sentence. Output is not necessarily sentence for sentence, 
since the transiator does have a limited facility for breaking down long 
German sentences into two or sore short English ones. 
— i 

The sections that follow dea! with the system in approximately this 
order, with the exception that information about the dictionaries is 
distributed throughout ae it becomes relevant. Chapters 2, 4, and 6 discuss 
the sections of the mode! that have been implemented, and chapters 3 and S 
discuss some issues in representing knowledge and understanding natural | 


' language. 


This empty page was substituted for a 
blank page tn the original document. 
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Chapter 2 -- The German Interpretive Grammer 


2. 1 The Under ing Theory 


& 


The analysis of ‘Gakiten was based on the theory. of pyatentc. gresnar z 
developed by nN. A. Ke Halliday. age). § Sone of Halliday’ s. ideas on discourse 
| have also been used, but diecyesion of tt this Ae, deterred to section, 4. 9. In 
describing systenic or emmar I must receeser ily be orlet, 2nd, the § reader who 
nants an in-depth treatnent is referred, to Hal ide, As ot ae eakd and Hipograd 
The central precept of Halliday’ @ theory is that Vanguaae 16 structured 
to convey meaning. That is, any analysis of language cannot, end should not, 
divorce form from this singie: overriding fahetion. The job of conveying 
meaning is delegated among different syntactic units, of which there sre three 
ranks: clause, group, end word. At the group level, there are noun groupe, 
preposition groups, and adjective groups (1 follow Wesson) herg in exfling the 
verb group; see section 2. 6. 1). The mechani em of cenkatil tt paral one unit 
to assume the role of anether, for exemple, a Clouse hij toke the place of a 
noun group. Associated with each unit is @ network at fdatures, with a eet of 
mutually exclusive features known asa susten. . These networks specify the 
choices available in the language. We move from one level to another in the 


netuork by satisfying thé entrid cor ‘Of’ a a§etéa: “Each choice’ made may 


set up certain constfainte on the surface structare’d?’ ‘at ‘utteFance by teans 
of ceatization rules aesoctated with 1%. For ‘dximpfe; “the decision to put an 
adverbial! (a preposition group or an sdjective ‘groiii in the ffrét position of 
the sentence In German means that sobject-and finite {i.0, conjugated) verb 
HIN have Inverted order. An importent point about Hal tiaby’s theory Is that 
the choices at @ given stage are. Rdt ordéred! Ai th ‘dehedt’ to edich ‘other. - 


2h P 


Unlike transformations! grammar, we are rot deriving one ful ly-epeci fied 


structure from another. Instead, ae we proceed | through the networks we 


: " é > t 
ee Re ct ete * fa Teeth : Be ee 


accumulate aha intareitte, until tn ‘the ‘a the our face structure of an 


A network for ‘the Germen clauses handled in the system appears in Figure 


Pee iy 


2.2, along with exeupte ‘sentences. i heve “tried ie ‘keep the ‘Presentation 
close ‘to that of Winograd, © ‘eo thet 0 reader ‘tint liar lth ‘nié Engl ion charts 


should have no di ffleutty here. The ‘notation weed in the network le ‘shown In 


Figure 2.1. In Figure 2.1a, the verticel line indicates a systen, and ‘the 


CHT 


b). 


Flowre 2010 ugh 
horizontal line en the left speci ties, anentey.cangi tions , The system may be 
labeled, as it is.here,.by weiting. the, label above the.entry condition. Jf 
,two independant eyetems share the sane entry. condition, this le.indicated as 
in Figure 2.1b.. If there,is,.more then.oge entry candition ssseciated.with a 
_ given system, it mey galy be necessary, to.setiafy.oneof- them. (Fig. 2.1¢). 
_ Features indicated by @. dashes {---)_ ore, .yemanked, aod ace.datined merely as 
the absence of. tbe gther features In the puptem, 0 |. 


i 
ype 
aw . 
e 
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The following example sentences correspond to the numbered features on the 


previous page. The frenesations — Wwealetet rou idea of the 
structures involved.;. - 


a) Ein Semi-kolon. gendgt. 
A semi-colon suffices. 
(2) Oft, gendgt ein Semi-kolon. 
‘Often, suffices a seni~colon. 
(3) Sie wirden elngentlich nur asuffeiien, wenn sie 


ausfal len. 
They would actually only be net ices, wan baw <. 


missing. 
(4) is den Hund zu retten, stirzte sigh: der ‘leuer ine 
eser. 
To save the dog, the farmer plunged into the waiter. 
(S) Alte Lieder singend, kamen die Kinder hinein. . 
The children came in singing-alg songa. 
(6) Am P£lock eangebunden., &chzte ott aelaneenes Sdot in 


Wel lensch! age. 
Bound to a poet, ‘an abandoned boat: in the waves. . 
(7) Eben bekommen wir die Nechrichts. sec. Zug Hach 


nicht sboefahren sel. . 
We have just recieved news that the icain bas ‘dat. iett 
yet. 

(8) Eben bekomnen wir die Neahricht, dec Zug sal neigh nicht 
abge fahran. 


We have just recieved nexus the train bas nat lat vet. 
(9) Ich hette-dieGelegenheit, Bertin zu feauchen. | 

I had the opportunity to viait Berlin. 

(18) Eine Efedone, deean alice Sauon-anttarnt, werden = 


sind. a 
An eledone, wii: A; are theve. remdved, ... 
(11) Mit dem nec alls siz herwerhees tans 


Chromatopherenepiel ...— ery 
Along. with the by dioht stimulyee ed play of the 
chromatophores ... 
(12) Ein deutlich sichtbares Zeichen. oe aig im Nervenevaten 
ver laufenden Erregungen . 
A clearly visible indication of the through ihe nervous. 
suntes running excitations ... 


aa Ten ne I 
1 
(14) 


(15) Ich bitte dich, wich ROCGOO Zu besuchen. 
I- request you to visit a ee 
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2.2 A Definition for hg mes | 
Distinguishing syntex from seuantice is e ‘slippery business, especial ly 
when I have just cleined to subscribe to the Nea, thet the structure of an 


TEMBER eo 
utterance is intimately entuined with the meaning it conveys. Still, there 
SME wail af: Hi a 


seens to be some life lett in this very. old istinetion, so let me make an 


attempt: to delimit a useful boundary. _Teadi tional a euntax hes dealt with 


Peyge geet 


the form of ‘praises and senentice with ite content, or meaning. This 


QF 7OP tes we pores ve 


_ definition is a ater ts but ‘it is too much of a simp! ification, since it 


Me ee ed er 


_Jonores the effects that senantic, content can have on form. In Gernan, for 


PU BR THT ABE GR aS eBar 
example, the default adouchlal order ing is time before eeceeieall before place. 
aay ig ¢ ‘s 


Here, time, manner and piece look Vike eenantic categories, ‘wut their presence 
es gears? eDP PS Se a Pupges teen _ 
has a direct ane 20. the surface structure . the utterance. 
red Pram hi ged: ; 
The whole ‘al tuation becomes even more complex when one considers that 


eguied saitene Bei gs 


moet cholces woout ie torn of an utterance heve sinat <%) in terns, of the 


traditional definition, sonentic. Implications. qt ‘s true that ‘there are 


Sere 4 


choices that seen to be not vated oy, sntectic rules alone, euch ae ine. fact 


Fire See | Been 


that the preposition, ‘oboe takes the eccusat ive cae, In German, shite one. Tike 


. pal. takes the dative. (Here, apd in what folloys, J L an using * ‘rulet In ‘the 
: broad sense, to wean. a regularity In language.) tore cppmon, however, ie a 


situation such as plural formation, In Gorman, there. are several Heys. to form 


_ the plurai for noyns, with different nouns towing. Beh eed sey ~ Ot 0 ST 


etc. The choice, betusen endings is a syntactic. or Botphol pgical one one (al though 


mary words are fairly idiosyncratic), at at th espe, ting, the addition of 8 
plural ending reflects a distinction betuaen one and more than one thet is 
basically semantic, in the traditional renee: 


By Peet epi se Beef Ppuads 


To firm up own definition, one a ta go from ners Mould be to Jeenerty 


the ‘euntactic aqpect of language with ryles that govern, aontence format ion and 


eee 


son ce ata cing tackles eee ha 


aes os wBES whee BOT sce 5 satay eaten shes: sa aa ote ¥ OR Baws 
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say that semantice is the great morass eyond. Thies definition begs the 
question, however, since the content of fanguege: od be just ae rule-governed 
as ite form. For exenpie, there night be a rte te eres “the fact thet “The 
“blue horee is bive* is rédundant, or the fect that ‘fhe: blue horse is green* 
is contradictory. To call such phenomene emtactic because ‘they ere rule- 
governed is fine; the ‘onty troubte Ie, tower, that ‘then it is not clear 
what, if anything, is feft for eonentice, , . 

In building the system, my decision uhether @ given eepect of language 
was syntactic or svmartie » wes ‘paeed « on the + fol lowing ‘detinition for dies 
Syntactic rules ‘give: — oe . 

Gi) a minimal speci tication of word order 

tii) a specification of the wor phonic nd. lexical tags that explicitly 

mark the relationshipa | betueen words and the rotations within and between 

syntactic structures ope ee Se ee 

ida speci fication of pumetuet en = 

ALL the rest ques into senentice. “a - ignor ing ‘the eewentic/pragnatic 
distinction, slnee I find it even more difficult rT drew than the 
‘syntactic/sekentic one.) Note ‘thet ‘Engl isn syntactic putes’ ‘rely very heavi ty 
on word order, while German strikes more ofa Bilende between ord ortier and 
“ morphology (i.e. case distinctione) « With nore ret lanes on explicit tage for 
its syntactic distinctions, German hes a conceal tantly ‘higher degree of word 
order flexibility than does English. With reapect to ti), the definition of 
minimal can reaily only be given in the form’bf a Ifet of syntactic rules. 
The closest thing I have to thie now fe the set of word order constraints’ 
built Into the interpretive arid’ generative grasmers, but of Courve ‘these are 
not complete. To give you an idea of net T have fi mind for (1), 1 would 
"consider something like verb ordering in @id-order ‘a syitectic ruté. On the 
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other hand, such things as adverbial ordering (mentioned avove) and. adjective — 
order ing before a noun ("the big red block" as opposed to "the red big 
block") are order ings beyond the minimal, gad are _beced. 90. ggnantic criteria. 
Relating this definition of syntax: to the sabsasah of ateeliucemet I 
would call @ feature syntactic if ite associated resiizetion rules all fail 
into the three categor ies given above, ee 
doce realize that the definition. of, syntax, divin here. je a sketchy one, but 
it should be enough to giye @ sense of he. gciteria. thas. vere, used for 
deciding, which, _Peguiarities, should be Fatlested., Jn the syntactic componen 
(both interpretive and generat}ve). and. uhich. in the, pment |e. gomponent.. 2 
should add here thet thees criteria were Ignored ocpeejonal ly. for, the sake of 


ie 


efficiency, so that the interpretive grammar program is far from @ pure | 
linguistic statement. — - 
23 Word Clases = 


White I aii not. discuss the treatnent of clauses and groups In Satall, I 


would like to look at classification at the word levels He description of the 
different parts of speech used for German words fol lous, white @ ounmary of 


the features used in definitions can be found in Appendix A. 


2.3.1. Adjectives and Adverbs = : 
_ A distinction between adjectives and.adverbe ls especially difficult to 
draw in German. Where in English ne heve "Fido sings well” and "Fido is 
good," the equivalent in German ig “Fido singt gut" and,"Fide Ist gut." To 
get an idea of where the distinction between adjectives, end adverbs is to be 
drauin ~ oF ‘uhether puch a distinction aven should bq drayn - ua need te, know 


the functions that these units will have. Five likely-looking functions would 


be: 


Sor nS oy 7 op aR Re 


ae 


(1) attribute of a noun (declined), as in “der pblaue Engel" = "the biue 

ange!" | ee ee | | 

(2) complement, as in “Die Eledone ist glug” =°*The eledone is smart" 

@) postnowinal modifier, as in."dis Strasee Links" = “the street on the 

left oo. an oe 

(4) modifier of a verb, as in "Er fShrt achnali” = “He drives fast” 

(5S) modifier of the modifier types (1) through (5), es in "eine sebr 

gute Gelegenheit" = “a very good opportuni ty* 

Especially when dealing with German, it\mby be bette? to have @ single 
adjective-adverb category called “nédifiers” and ‘evel the probien of where to 
draw the line. A highly conservative ilns, however, ie drain for the eysten. 
- Basically, an.adverb ia a member of the clase that performs function (5) 
above, but. thera are several conditions that. further constrain adverb — 

monder ship. - . — . _ 

One criterion for an adverb that will be used is that it may never take 
adjective endings, that le, pectorm function Ui). Further, en adverb may 
never have a comparative or superlative form. Both criteria must be present 
“at once, since some perfectly good words thet teke attributive endings do not 
have comparative or ‘super lative forms, arid there te words that can perform 
‘both functions (1) and (5). An example of the latter is deutlich in “ein 
deutlich sichtbaras Zeichen" ("a clearly visible indication") and “ein 
deutlicher Satz" ("a meaningful sentence”). Candidates for adverb, then, are 
words like gg, sabr, Zu, and a few others: ee 

It must be obvious at this point that ddverbs ere a rather select group, 
and that the clase "adjective" covers a lot of ground. To alleviate this. | 
problem, the systes uses features that correspond to the functions given 
above. Adjectives are either ATTR (attributive, function 1), COMPL 
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(comptenent, function 2), POSTNOM (postnominal, function. 3), or RELMOD 
Arelation. nodi filers, functions. 4 and 5)... Fone, PALO. diac} 

object. as a verb does. for example,.."Oes ist nicht ar tha, a (That's nes 
worth the effort"). If thie ie passibie, then, the case of, the object must be 
LM dalsetive.., ee ee re 
ATTRibutive adjectives are DECL ined, which means “that they take acdings, 


speci fied in the feature | list of, ihe. 


Which are specified for case, gender and number of the main noun in the. noun 
_ group. Adjectives are alee said to have. 8 STRONG, WEAK, or MIXED declension, 
| depending, on the determiner An the, POUN Grou. STRONG adjective 
used when an adjective \s Bu Alret pael tion In the, noun .0Up), tor example, 
_ “guter Hein* c ‘good wine") or “then _the adjective pare! | Let snother atrongiy 

declined adjective. ("feiner. lephaf ter Mein” / PAA Oe Alvely mine"). MEAK 


cn 


endings, are 


. endings: appear on. adjectives following om detereiner uith 4 strong endings 
| ho der gutg. Wein® us the good wine"), and MIXED endings . follow indatinite., 
“determiners, since these may or may not carry case 2nd. gender. information: 
"einen putea | Hein” {accusative), _ but “ain, outas | Hein’ (nominetive). aeeet 
; I should rote that cal ling word thet fulfill fictigne Q), (2, and (4) 
adjectives is in agreement | with ‘th anata pt Gling (8), penne my adverb 
category is more tightly. constrained, ee hile. 
2.3.2 Binders . he, Beets ee 
Examples of the clase BINOER are dass, nachdem, gait. uann, etc. Binders 
appear in the firat place of ¢ pubordinete.clavea, Mano/gann, combinations 
(the equivalent of If/then) are not handled. in the, SEINE. but they . would 
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need a special tag in the dictionary. 
a _ Cerdinale wu, Like gual, Argh etc. oggur chan eee and right 
now the parser assumes that they wi)! notve declined... 


2.3.3 Cardinals A hs 


2.3.4 Conjunctions | 
- ‘The coordinating tornjuncttons - und, ‘ateic, ‘addr, Mefiifern 8nd enh - are 

not given syntectte-features, but ‘Inetemd ar@ defined using a special 
function. Ih the syétem, the parsing of conjunctions Ie done using a program 
taken from Winograd’ # ‘syetew WIth only wiight modi tication. “See Winograd (39, 
“p. 9B FRI | ae® oa 
(2.3.5 Determinere 

Determiners (DET) are a fairly diverse clase, with the common property 
‘that they can, and usually do; ‘ocupy tiie first ‘position in @ noun group. 
Those with the feature OEF - the definites der, dia, das, das, etc. - cercy 
“specification of gander, number, and case. The indefinltes wii in and 
kein are ‘di ettngul shed with respect to nusber (Site or PLUR) and take endings 
that indicate gender ana cate. Postessive ‘deterniners “pdss) Vike ‘gein, dein 
and sein, specify person, ‘gender and number tron their tsidclated’ Pronouns, 
then take endings ‘for ‘getider ; number, dnd cease of the main noun in the noun 
group. Oeterminere that take "der" endings are called demonstrative | 
adjectives here (EMAL) and include diee-, Jen-, etc. There are also the 
interrogative determiners (INTER) unich- and ‘ieanec. Welch- is declined for 
gender, number and case, while yessen tekes nO endings. and carries no such 
speci fication. aan an 
2.3.6 Inter jecttons 

These words, ike aber and ja, way ‘sbpser betusen the sweet and verb in. 
regular or fnverted ‘oder clauses. . ot, ne 
2.3.7 Nouns. 

Nouns may be either MASS, COUNT, or preper (PROPN), which are relevant to 
whether a deteriiner may be used. They may be. STRONG or WEAK “depending on 
what endings are taken. For STRONG nouns; ‘the plurat ‘and genitive endings are 
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given in. the definition, | The genitive ending 1s necessary here because there 
1 not enough morphological information. iq, the. input.coutine..to derive it. 
To do this reliably, INPUT would have. to..teke, inte cougideration: the number of 
syllables in. the word and the nature of its-terpinal- letters... 
2.3.8 Participles Ree 

The PARTiciple class. consists of past.perticiples.. (PAST) - like. 
geachuommen (suum) - and pregant participles, (PRES). 1Hke agbalmmand 
(suimming). Participles are not entered in the dictlopacy,exptigi tly, but the 
_ under lying verbe are. . [t ie the responsipllity. 9f.. the perpbelagy program, | 
digcussed balow, to transform a verb.dafinition..into one..for aperticipla. If 
@ participle is DECLined, then it.  tokes. the, same endings a9 en. adjective. 
2.3. 9 Prepositions — ee eee 

Happily, thie isa simple part of epepch 
are either pre-fixed (PRE) as in "zu.Hauee,".or-post-f)xpd(POGT).0,,in,."dem 
Have gegandker.” The cages they govern acnnitheedatiye (DAT), .ecgusative 
(ACC), genitive. (GEN) .or mixed (MIXED), j.@,-elther dative.or eccusative,. 
2.3.18 Pronouns 


eystectica! i as ea 


As in the case of determiners, there are @ number of varieties. of 
prenauns (PRON). Most commen are, the. personal. proneyps . (PERS). . which is 
actual ly @ poor choice of terminology hera,.since. in; Gergen. they. sre 
_. frequently used ta refer to Inaninete objects. Pereppal.pronouns.are 
specified for person, case, number and gender. (if third.pergen) » There,.is 
_ alsg a group of personal pronouns distinguished. by; the. feature RELMOO. RELMOD 
Pronouns like daa. wag,.eod.da (uhen.da is: compounded. uith.a.prepasition) may 
refer to whole relations. or statenents,rather., then. just toether..nown. groups. 
The interrogative pronouns (INTER) like.yar: end yag.ace epeotfied for case, 
while the possessive pronouns (POSS) - Jike. melgg -.ere wacked.uith gender, 


& 


number, and person-6t the ‘prenotin part, then“endings Indicate gender, dase, 
and number “in: the-pronean’ se role jn the noon gréup.” ‘Rbetract “pronouns < 
“= (ABSTRAET) ‘are thoéeé: Fike bag and® ‘Jenn. Thé relative pronduns (REL) ~ ger, 
die, das, deren; dideen, watéha, ete. °- look a ' let* Ite defini te determiners 
and cerry gender, number, and tese information, Finelig;'BEPinite and 
INDEFinite determinit's may: dtso- be‘used'es pPimotNE; ed they carrg the : 
| features PRON BEM (for destonethatl ve: prendunt : v0 E 
2.3.11 Quant? tiers: a 
~ Thiscclase te used for words: that appear’ in firet post tion-of the ‘noun 
‘Group but'can ‘coexist with deteralners, ¢.9. “all ‘Bie Menechen” (atl the 
peopte"} and- “pa lbet ein Caphatopod®: ‘atan's ‘eophibtoped™) ne 
2.3.12 Seperabie Prefixes 

fr general; these: ceinetae with be att et “and their 

usage Correspondé’ to. the Engl ih’ particle, ae in *T! 1féal lf thenig. © 
 Separabie prefixes (SEPPR) eipéer ti tle diet ioneiy is sepéréte-werde tn terns 
of syntactic features, ‘but they are not given sdaditic defialtidhs Independent 
of their associated verb. ae eae 
2.3.13" Verbs ; 

Verbs come in the fot lowing varieties: | Fttet, ii, are ei ther sain verbs 
(HVB), aunt lates (AUK); ‘or’ motets (HOGKL) © The sun lTiar ies are: ‘haten, sein, 
and werden, ‘and*motdts are those: \ thet require y ae | Thfintt{Ve. “‘Modals. 

“ine fade. kanal: “tiation, sieedn, etc.” a3 

If a verb is ao fWB, it wey pe PEAIN; <SEPPR or SSEPPR.° +SEPPR means that 
- the verb: ¥éhm heppins’ to have’ its wepePdstediretin 44eched tepnbiiit)’, while 
-SEPPRneane thint thie! eeperabte’ bretin 4é' sokedhehdeibé’riGht ton (iillnat 
davon''ab)s  PEAMN verbo: are @1F the rest. A veri Ge? inition atso specifies 
unether the verb: 16 ‘heguter;: integuier; wiked,- de tinks ah GaTaut in’ the 


second and third person ajngular (REG, IRR. MIXED, and UL, respectively). 

Yerbe,. With ingspareble prefixes, like peaitzen,. ae, Merked, INGEPPR. . This 

_ information ie for the benefit. of the. merphalagy routines. eee et 

Another gort of information, js, the. tupe of objeata that. a. verb may, teke, 

_ -Hhich wild determine the transitivity of the Gaver. Jestead of using 
“festuces |ike TRANG).tive, J. have. anec) Chad Jrenel tiv) tysin derme: af, ayntact ic 

| eviationa for afvachjels. .For example, A+D 
Ve the feature of a varp that, takes. on accusative, ang. ¢,dative, object, Snot 

_ Recesserily in this order), If. a preposition is, ragyjred.by:the, verb, then 

its transitivity. is given as “P", while if any adverbial is requirad,. "E is 

intransitive verb is. still marked "1," and Ht, "Hen fal 1"). jis used. for 


: cape. | for noun. groups, and other. abbr: 


used (for no _particular, reason; “hy. wae 


dative pronouns used reflexively. "2" is used if aren 


hi fted, ‘zur clause Te 
required, as ins : 
, Auch der Trichter, spllage poet ee aacaeioaus 
At the same time, the ambulatory funnel aan eaibia a oaee i 
I have not used categories like “required location,” Ainge this seams, to be 
part of the broader phenomenon of semantic relation between srquments of the 


verb, which is handled by the semantic component. (in this case, a selection 


restriction would be used.) A verb with a required. Jocation, , then, is marked 
with E, P, AsP, etc. as relevant. For. comlete. | lat.of. the, tranei tivity 
types used in the system, see Appendix A. 


| 2.4 ‘PROGRWEMR ms ds Goren, ae 

A network like “Figiks 2.2 gives 2 description of structures possibie in a 
language, but does not specify how ue gn epeut pelating .glyan. sentence to 

this descriptive Information. . One. solution to, thie prab! 


#. ue. given by 


3 


Winograd, and for the interpretive grammar, I followed nis approach quite 
closely. Winograd’'s approach ts tot the only Wey, of course, to Use the 
information in the networks to gulde paring, aH Twill come back to this 
jseue in section 2.7.2. The German interpretive’ grammar wis written using 
PROGRAMMAR, @ LISPLonedtied! [aniguaye: Uuclineited in’ Ulinogrid,38). In brief, 
PROGRANMAR provides faciti tive for céhetriktting, ‘Thepedting, and wanipuiating 
@ perse tree. The basic function ts {PARE edaturees) thich Inepects the 
input sentence and tries to add the spec! tied fiode to the tres. A group of 
feature-examining functions (Nd, M0; CQ, 150, ete! aliow inspection of the 
sentence or tree for perticular features, and ‘another group ‘(among then, F, 
FO, A, RO, A0O-F-NOOE, and RENOVE-F-NIDE) sia changes to be made in. the 
features of a node. For moving around the péree tree there is the function 
MOVE-PT, unile MOVE-PTH can bé Used to mdve arce 
peree turns out to be incorrect, the backup functions POP end PoPTO may be 
used to remove particufar nodes frdw the iad The basic statement type Ie 
the branch statement: 


pig 


“U Cefunction oF variables) <label-l> <1sbel-2> <lebel-3>) 
Control goes to labef-1 if the function or variable evaluates to non-nil, to 
label-2 if the Value je nil, ahd to labeT-3 Tf the end“ of the sentence has 
been reached. Lavel-3 is optional. — 
Since PROGRANMAR use débigned to handle Enel ish, some changes and 


extensions were necessary for processing ‘Bariéiin.”” “involved the addi tion 
of another eyntactic level, that of the phrase, the expansion of the apparatus 
that assigne feature lists to words and nodes, end @ mechenien for hand! thg 
partial information as ft ie accumulated: “The rest of this section wil! 
discuss these additions to PROGRAMAR. 


Ae in Winograd’s eysten, the ‘actual parsing fs done by the syntactic 


CAGE ecb tere og ofS gt hbeciet 1 giigighgg y Ie epee SRPS Seton sERRE gp Pre Zh SE eee 
: oe he. se ae ace Pere Le : : : ieee 
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specialists called by the PARSE function, . he special iste corresppnd to 
units, so we have CLAUSE, NG, ADJG, end PREPG. In addition, the. German pareer 
- containg routines. that. handle phrases... that. Ja; consti tyente that often . 
appear tagether and for which eubroutinization. maken genes, put which do not 
enjoy the theoretical status of. a group Al ause. : Tha, panic, difference . 
between the treptuant of groups and phrases js that 9. node. |e nat estab) ished 
for a phrase when its routine_is entered, ut .@ group aIHAUS | has. its oun node 
marked with’ ite name and features. Phrase programe ere used to handie things 
like verbs and objects of verbs. The components of these phrases are 
interrelated, so we would like to handie.them.together, . Thay are not full 
units of the grammar, however, alnce the comonente.dg net hays to remain 
contiguous under all circumstances. For ,exgep jee in fecnan a.direct object 
may appear before the finite verb, uhile the ladirect omject of the sane _ 
sentence comes after the verb, No changes had. to he,made in PROGRAMHAR. in 
. order to urite phrase routings, since they are treated Jike ordinery _ . 


subroutines. Phrase programs look Iike group.progeage excent that they do not 
use the reserved tage RETURN and FALL. Hodi fying ROGRARIAR 


to permit use of 
these would not have been particulary difficult. gnd. for. wniformity the .change 
should probably have been, made. . 

The case-gender-number combinator ice of Corman. (deeer iped below in the 
morphology section) made it necessary | to ‘euiteh- from, e single list of pope lel’ 
features associated with each input word in the original PROGRANMAR to 
multiple feature lists. Each possible usage of a word; then, is expressed as 
a different feature list. To handle these, a@ few simple changes tere made in 
PROGRAMMAR. First, most functions now handie’a Iie¢ of features with an 
implied and linking the entries. Thue (Nd 'MASC SING] checks to see if some 

feature list associated with the next word has both HASC aid SING properties. 
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It would fail, for example, if (NOUN PASE PUR and NOUN FEN SING) vere the 
feature sets in ‘questfon. we 

Tuo other additions to PROGRAMMAN, FIX and WONEX, were also motivated by 
the proliferation 6f case, number, and gender possibiiities. They are used 
for dealing Hith partial information and “are discussed further in section 
2.6.2. To discuss the Interpretive: procies, tet’ us start With the sorphotogy 
program, since this: fe the first stop-waud by tt: Hripdt seiftence. | 


2.5.1 Anatyzing Morphofogicat Tags | 
Given a word, the job of the morptiel ogy. component ‘Ts to determine. its 


mee: 


ete. Proll Titérmatl of: 


root and then matte up s tist of eyntectic reetére 
associated with the root and endings.  Norpllofogy finde ite inforaation In the 
German dictionary, ‘waich contains beth reste ahd endings. Syntactic 
information for a réct le iieted under the: kaguard FEATURES; unere there is 
one feature set fér each possiblé usage. Thue for tralt tuige?, which wight 
appear in contexte: like "die bretten Stravesn® ("the broad ’streete") and "Die 
Chrosetophoren bretten sich sus," ("The che oa topheres: spread out*) the 
syntactic part of the dictionary entry: lookd tiker “a 
(OEFS BAEIT: FEATURES ( | 

(VERB. REG -SEPPR AUS) 

(ADJ. ATTR COMP SUP): 
In other words, breit may be ei ther @ regular verb thet has the separadie 
prefix aug, or an sttributive (that ie, prenopina|) edjective that can form 8 
comparative or a superlative. Actually, such dictionary entries contain more 
_ information, and the complete specification may be found. in Appendix A. 

Morphalogy finds its ending information as ° list of feature sets 


indexed uy ENDING, end then the part.of .epsach, Endings, are wacked pu.">", 90 
there is no chance of confusing the ending.2§Suith-the ward ger. (This... 
distinction ia more for the benefit of the user, since the ENDING .indax is 
enough to keep the system on the right track.) Part of the information 


assoclated with >ES looks | ike: E> wets a 
SOEFO 265 ENDING. ( idee ed 
wo ARSTRACTS,.GEN SING)),,.. 
{NOUNS STAQNG#.GEMES# GENSINGL as 
nies Saves GA sth 
MM ole ‘ithe Obo“Stpane Neu won Sindy 
(ADIs. ATTRe DECI, STHONG-NEUK AGG BINGD? )) 
There are tuo main routines in morphology, INPUT and TRY. = 


2.5.2 The Routine INPUT 
INPUT is the German equivalent of Uinograd’’s morphology “analyzer. “Tt 
‘starts at the end of a word, making successlve cute uti ail énding sees 
possibilities have been tried. “With German ie gs get’ ‘\avolved in éompound 
éndinge,: ‘for exemple when the essen perticipie. is uebd és an ‘tdjective, as 
‘ere lege oe sui aes 


‘in yeranlaseende teaualng) “© ve i+ wa #6. “In addition, ‘there: ‘are some 
prefixes to consider, ae is the case with be th the past ‘pertictples” “and there 


:. gave ee 
$e FAT ok Bae mt : 


are also some infixes, i.e. ge and zu, as in 
duageorbeitet tuorked out)°="aue + ge + arbelt > et ° 
anzuschauen (to look at) = an + zu +. schauen 
The input programs handles all “regularly ‘occurring: ‘morphological’ changes and 
aleo takes care of sone non-etandarl situations {ike the addition of an ‘umleut 
to the verb Lassen in y thied person singular Cidest). Cases that. are not 


. handled by | INPUT’ s vending analyzer oppear directly a ‘the dictionary. These 


fs age FS 
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‘include the-var ious incarnations of gein-(ty be), past parttciptes with a 
vouel ‘change Fike getrochen: (cited; and weane ute ‘undeua! -prurals” 1the 
Senta (pt: Sapien). 


2.5.3 The Routine TRY 
Once INPUT thinks it has a likely eplit, it senda the stes and ending 
liat off to TRY. The first atep'here fete check to’see If the proposed root 


ae" RM a 


does indeed appear in the dtettonerys: TF eo; “he "pe ee 


p ite syntactic 
features and hold on to. then. We aleg. Riek. e teetyres, geseciated with the 
different endings. It ts at’ ‘thie pefwt thet: a begin “ta “notice some special 
problems associated with German morphological proceesing. Since an adjective 
ending may have a cheer dpe ‘atare: ‘tn numbers, ‘and may be etrong or 
Weak, the combinatorics begin to be @ problem. Agy,thie accounting does not 
even take into Sonsideration verte. nouns, oF. participles. some of whose. 
endings may coingige with adjective endings. For. this. ‘reason, TRY makes a 
firet pass to determine the parte of epegch possible for the root and then 
look. at the endings to wee of they form, a persiseible. pattern, given the part 
of speech. _For exanpte, breites (ppoa@) . need: Hae be tried, a8 8 verb, since no 
German vero le formed by adding ap toe root, Here, only, the possibilities 
for an adjective need be considered, and since the endings lists are all 
indexed by parte of speech, it is s simple, patter to pick. out the rplevant 
possibilities. — | 44 7 = 

. _ Having agli oniat the field eoneuhat, the next tee, ie fe. call the routine 
MERGE, | MERGE moves, through the. endings Nate, starting. with the, Mate. for the 
laet ending and working back tothe Jiet for the. het, Ite Job. is to ‘compound 
Joformation and el jninate, bed combinations. To do. this, EAGE needs to know 


which part of the ending possibility bist is required information ‘end which is 


ees Aa Sea o Sete eee cares y 


4) 


new. In the eysten, stare denote-information thet must:be present in the . 
preceding feature list end unmarked featured are simply -edded. As an example 
of the matching and compounding done in PERCE,” tend Elageries “Ciuing), wich 
is @ perticipte: = a a ee : 
liegt ‘(VERB IRR NVB) 
_ ends (VERB« = PRESP) 


-es: (PART PRESP« STRONG NEUF NON. SING) 
(PART PRESPs STRONG NEUT ACC SING) 


feu g a. 


Matching on starred otenents, and'adding the unstarred Information, we get: 
(PART PREGP STRONG NEUT! NON SENG-TAREINB) 
(PART PRESP STRONG NEUT ACC SING IRR MVB) 
These sre simp! ified versione of the tuo teature sete: that°il 1 be ‘aescciated 
uith the word Litcendes, A epectal sétith heetanen hereby the routine TRY 
because we are dealing with a change An’ pert-or spesth. “For'thie epecia! 
‘case, the feature: VERS nas removed after PART hee added, to prevent the part 
of @peech des grieti on from betening pabtpuowe Itt the #inel-feature sets. This 
deletion 'I8’done’ by # simpte check’ in TRY, sites: apart of epeech change 
occurs in only @ few cases. Sesame BR ae 
Each calt of TRY by INPUT may add feature vete'to the syntactic feature 
list, so that in fact @ word may be divided :th several’ uays. Te-Reeb’ the 
possibilities straight, a root list is also constructed, with an entry that 


corresponds to each feature set. - fy Pras 


2.5.4 Spectal Features of INPUT — 

If INPUT fe not sucowestul with ite Initial ending analysis, it tooke 
for a compound word. Compound words appem frequent hy in Geren; often where 
Engi ish vould vee # claévelfier plus = news, Geruan usew:a compound, Ths: 


2. EL PODRE cet give ie wes ih + aria tiaeigersearaies tcc te note 
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method. used ie a.brute force one... The verd.is-eplit inte.tno parts and.each 
Of these is fed.to a recursive call of INPUL,. If beth sections turn out to be 
«words, we coneteyet a feature. i:bet.. If-net,. -then-we move.over a letter, make 
another split, and try again. Al! sorts of rat inenente, @.g.. only, making 
splits. between syllables, are possibilities haes;; gat nothing Iike this has 
been done, Right now, the system handles only coupeundemede from.tuo 
components, but the code.caujd ow Sener ahdned tairiy easily. The... compound 
analyzer will accept noun + noun pairs like ‘Chcomatapborenspie! ("play of the 
—-ghrematopheres"),, verb + noun, peire like beucbtocaae., (SLight.orgen"), aad, 
pronoun + infinitive used, a¢.a foun, alnee |e Sietomes aden ("Finding each 
other"). . 

_ Another, feature, of INPUT. Je ite bandi.ing..of Apfinitive. verbs used a8 
- nqune,. 1f.8 word ie evaluated as, an. infindtlve VPER. aeald poutine..1e., 
called, to add features. and semantics for.a9 Infinitive ysed.ag a noun, Note 
that in @ normal text, both verb and noun uayld. npt be. possible at. the, sane 
time, singe. nouns vould. be cap) telized,,.,.The,terwinal. used, here for input. 
however, had only Veper. CARe,, and it seemed. that, any, apec)#|). conventions, for 
_ Mouns would be a burden on the user. For this raason,. nouns. are. not 
_ distinguished from the rast of the Garngn: input, andthe system just works a 

little harder to. pick them out. os 


2.5.5 An Alternative Sie 6 

| The morphology component works well, with only one real hitch: words 
ending in ean take a considerable length of. time, even. with the. special: passes 
made to. cut comb inator:|oe, Thie may:.0@: an. indigestion. that nerphology and 
Syntax ehoutd not be @ .two-pase.prepoe:tion,..but rather. that. the. etate of the 
perse shauld he used to.lialt possible merpha logical enalysis in. the next 
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_ Mord. Thus if He have “einem alten Mann” ("en old man”,dative cass), hy the 

| time the eines has been parsed we have gathered enough information about, alten 

to limit it to tuo possibilities: elther (AQJ WEAK DAT SiNG MASC) or (ADJ 

ineteric preblens, 

He might distinguish two Jevele. of morpholagica). featurere... «Jha higher level 

_ features could be sesigned. ins erorocaen a |The. She rgnent, omer. At 

_ that point we might just enecifur ams that an, Adingt! yn, hed peqn, found, yi th 9 
‘perniseible adjective, ending (AY DEG. .Latgr...19 Rara}og. she. noun. ore, « 

1, Segond morphojogice) pass gould Pheck, ta pee Lt. the. proppsed, adjective .. 

; exhibited the correct case, gender, number, and tune 4i th, ragpect to, the — 
Geterminers and other adjectives...n. the, poun, sca Hah FNS, appre. the 
_2empinator ice of, adjective, endings need aniy, be tapkiad hen 1, ApenaC. 

abeplutely. necessary, This approgch would pot be, difficult to, impleaget, but 
: At would involve considerable effort to convert. the existing system. to use it. 


WEAK DAT SING NEUT). Therefore, to solys. pornte|pau’ 


pis 


o 28 “The Operatipn, of. the Seem pagin te 

The Interpretive greumar. operates on. the. string of; eee a “teva 
lists ‘output by the morpho logy routines. | Jt , goes. so, work te. conetruct. 9. parse 
tree, and at any given, time the yar | wilt _be. foljouing, up. only. one parse. 
If entackic ambiguities lead the grammar astray, special backup routines are 
called to find the difficulty and set the grammar onto another, hopefully more 
successful, path. It does not seem Profitable tp Al epuse the. interpresive 
grammar in detail, since its behavior is so clase to, that. of Hl pograd’ s 
English grammar. A sample parse may be found, | in. . Append’ x. B,, and | readers. 
Wishing further intormat jon on the approach to parsjpg, shou Jo, oopeutt Winograd 
(39, chap. 5) and, for details, Rubin M32), (it, do. seen to be, worth, 


BOM ~ 8 THES 


i Pt 


discussing are the places here German presents special, problens, or. sare . 


” @ so. 
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‘a eenre ‘apprivach ‘dis ‘taken, ‘90 this Wécticn will be devoted ‘to an assor trent 
of meee Yopice. | 


2.6.1 The Dovtve of tHe Vari Grou 
‘he ‘ude wan'tioted Previbdely, tO’ Cetin gk esiier iis Spec al programs for 
 efikdibe's, Hddh BHoubie, PiNibioe’ ttt rede, il di'fae'tTVe Groups. There ie-no 
Wee bradb ‘prtgeale, Wut Chere Tee WHb Phew bouttne Iheteid. The demotion 
Ot ‘tHe Verb Gridliy’ FOI Fetes Hudibn “(28Y; ute ‘aig trib orie Griterion for a 
grotip i Wht its GobpOnenits Wibt be Gbathydius.”” Th Engrish, of course, verbs 
do stick Yeiether & gebe prt oF tHe tTée: heathl ‘Ve Gabler ‘to wake @ case for 
thie Wk l8tbribe ot vadhdbads Ei iuaitie ides, Verbs are eacones multe 
CHW MAY get GH eer ier,* Trteratty, “We died Wilt Ub RED. “Th radt, 
‘vinetrevét tHbr'e whe Varbb other ‘than thé’ Fihite Vek Th'a asor clause, German 
word order requires that these other verbs go to the end. Giving verbs a 
phrase instead of i YFoup ‘btatds disk WUE BréveAt the grammar from developing 
tie tpecid) HeiWtiohelips that décur Be Hiden WRG If a ‘settence. It le the 
clauie proprek, holiever, Toteed WF a Virb YRtup brogran, that Is responsible 
for eabedicaiad these relations| or Babel te 


tevociatde bomen routines to” 


2.6.2 Radaling iia ‘Thforiat ton 
“THe GFT ginal PROGRAMAAR cones equipped ul'th three sokhtave for 
recording lhrérmation \ econ itd Th ‘Whe Gourde oF @ parse. * ret, ‘there’ Is 


Pane 


the conbtiuction oF the: pire tree ‘toot. : » there ere routines to. edd 


features to @ ‘pair'ee-node,” one thire,” we ‘tah ect thd jecbes ‘var iabie-like 


- pegtaters bebocléted ith @ node. ‘Retin tw ‘fnere 1s @ fourth techaniem, - 
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since the control structure iteelf isa war holding. onto..distinctions 
within a routine. None of these mechaniens, however ,..eutopaticg! ly, ae 
distinguish, between information. that, Je. 40, some..canes.. fragueatary. and 
information on which further deci sions. can be-paped... iy feagmentary or 
partial information, I. mean. Information about. choices. shag: ‘have. hean .nerroved, 
but not. fully decided... For example, the. definite acticiedgm.i9. @-noun. group 
is DATive, SINGular, .and either MASCH lpe.or NEU Ter.:. It, 1s- useful ta. know 
that FEMinine. has.been eliminated. fromthe. opeder. eysten,ee 8 possibility, but 
this is only partial Information, .since at. thie: point she: perone st} l)..eees 
den..a0. ambiguous, A. parsing systen. should hove; mays. 49 deal with. thie-partial 
information easel ly..bath to designate. the intormetion. al so .te4.1., the 
parser where. the partial. inforpation JM cee go yak FSP EH ses 
bargely in. responce to. the gender-number repee: <cciuist lew: Aue new 

facilities: wore added to PROGRAMA: -FiX ene MONEX«: FAX gi ves:u0e a umy.of 
hand ing -pertiel: information befers © pareerede:j8 copatructeds ve In: the: 
English version of PROGRAMNAR, the only way to specify the features: required 
in a word is. to use these ae perenaters- ef: the; agtuad. function e@it.:of PARSE. 
. With FIX, . we can eliminate possibie parse of: the: next: weed ae the: relevent 
information le: encountered. For exampig, (61K: MAGG SING) nevee any feature 
sete that. are not-both MASC and SING tothe bank ofthe jiet. -Sinilariys we 
(can:eay (FIX OR MASC NEUT) ,. which exites. feature ate thet: are not either 
MASE or NEUT.. Feature opts. that. have: been.e! lnbeotediare put; behind. a: serker 
00. thet they are-ne longer. scosesibia, tothe PARSE. funtion, although it is 
easy to recover oid pesalbilities by.erep ings-the marker 5; FAX may be-used 
several tines, then, te-nerrou.the possi Hilti es: befene,PARBE le finedty. 
called. One facitity. that. FIX. could. have but done: netor ighti ponds FX NOT, a 
way to. disqualify feature sete that contain the. feature. given. 
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Ore place where FIX is used frequently ie tn dealing nith verb phrases. 
Consider the exampte: a oot or 

Etne Eledowe, deren sft! iche Saugnipte entterit worden sind,... 

fin efedend, whose suckers Neve wi! been rumeved,... | 
After parsing the past participle uoriian, we knew that the next word must be a 
form of sin and ei ther en: INFinitive or ‘t Finite verb. We can use FIX to 
record theve facts, end then check te eee if the verb Ie finite. (Ati but 
first and third pareon plural. of sost verhe cen be uneibiguously distinguished 
“from the infinitive form.) Since it turné: out thet aid’ Is-e: finite form, it 
should agree with the subject, eo we cen tell FIX etn with the person and 
number of Gauanipte: (FIX:PIAD PLUR). Note:thet the addition of FIX dées not 
give PROGRAMMAR any new peck since we Ginays could hve done the site sorte 
of things by setting variables tebe used Jeter IW tre'tall te PARSE. FIX 
merely makes it eauler to ectumulate ivPéreetion @bout the nent word te be 
parsed, even: if thie information te found 4h wldatg-etattered parts of the 
ceca! | a eee ie ok 

/NONEX, for non-exclusive parse, allexs the PARSE routines te live with 
ambiguity, at least to s limited extent. Mun we-nime a call tike iPARSE ADJ 
DAT SING NONEX),.we are eaying, “Ellsinate feature sete that don’t agree with 
the features specified, but if sore then: one feature: vette lett teay, vets 
wl th different genders), don't worry right now.” in the gremmar, NONEK parses: 
are used within neun groups, so thet we are hot forced to make Glee, niniber, 
and gender distinctions on adjectives end determiners before the necessary 
Information is in. in the naun group, the ambiguity:wil1 only rarely pereist 
beyond the pelnt where the main noun le parsed. Cede Ie bullt into the noun 
group specialist to al ew us to go back ‘and clear up NONEX: parses (1.0. pick 
the correct feature est) unen it te possible te dese. Another place thet the 
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noun group specialist uses NONEX is in persing prosoyna. . Wesmay oot koew the 
| gender or number of @ pronoun until Its referent: has.been found... (de wey also 
net know the cave, but thet is, handled. PY, RMR ina taeds. ene. the next. . 

section). NONEX allows us to do the beat, ua. FON-MItH a proneuns maybe using 
| tub Ject-verb a agreement to limit. the pogelbh | Ltlene Adgen..the caferent bs: 
found, a process yeuel ly. not done unti| .the ead .of. the .antenge, we can-alean. 
up the pronoun node. 7 Ve . cee, ae oe eer 

FIX and NONEX, ‘then make jt. easier. ‘ta,bend!9 pastial, ‘intepmation., They 

_,,90 not, however, really come to grips with eque .of the deeper. prokl emp, af 


eet ate gpg tt te Ue RS, 
Eanes ae ‘ 


2.6.3 _Dbjecte gf the Verbe c.g. a: cake Gi. eet A, tee 

“thie the. main verb in. Frotien, diissetenlaane: Seaaian os anion: An 
German thie te much Leas, atten the caee., As ume, nentioned,.abpve,, nhenaver 
there is more then one verk, in.a major .claume, abt bie -thectinite.sent: go: to 
the end. In addition, wost.pecondery clauses ere epkcondder..by84, a1: verbs 
are at the end. Thus, ue frequently tind eurcselyes. confronting: objente.ef the 
“verb With no inkling of what, the main verb. iq..T9, tunther: complicate..the- 
situation, the ordering of different. objects mey. depend. oa; whether..ons:.er: both 
of them ar ®. POROUNG,, and, hat port af proneyne thay ara, The-situatien: is 
complex enough, I think, to force @ factoring af the prabies, .To this.end, | 
have made tuo distinctiona, one of Hhich: seas. succegeful,..vhile, the, other 
seems less satisfying. : : a ein Cu Sates 2 cs naeert 

Let us first consider the pore, eaeagll neagure,... Object. parsing. HES 
divided into two passes, the, YERBrOBJECTS. roytine- fecabeaes, rather..then- a 
JER-CHECK routine. ,.YERB+QBJECTS. finds. neun 


_ group, routine) and the WORD-O 


groups, preposition groups, etc, and. then. WORD 
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they are in the coftect order. This divisTon of labor had Adthing to do with 
efficiency of a viefon df “dhat people db." “Tt ad’ Herely an attempt to avoid 
a rat’s nest of comple: prdgritiatiig.  Tdblily, the ebtrch for objects end a 

| cheek on thett Word otaie stlodta Prova y Be doe in'perailel, but the only 
dteadvantaye I can’see If the way T have factored ‘the'probtem is that 1€ might 


o-take ackittte forger te rejeét: & bad purée tn’ dk dbe Ag” ‘Grouride. 


A second atpect of the grammar’s handling of objééts is the design 
. decieton thet thé cae of a noun grdup ts @ Afgier fevel ‘feature than its 
“ Gendev or number: Pit fe; a6 Ft etitad nde, “tha fiduri group specialist must 
always be called with a caaw’ epectttéa:” My wetl¥dtlon tor dotng thie was that 
at a given point in the pares, noun group case le often predictable. This was 
a bad decision, since if we make the cal! UPAR WE ASL) ‘ind Tf there Is a 
dative nodn ‘group ether “tian “ar edcdeetive’ehd, thd ‘noun’ group special ist 
WITT feet Ly never Rho Tie sitet “Pe afoot: THE a Ptah Ive Ie te pereTé NE to 
be cated WrtHout case! spéciffodt ton at fo heve’l't pared’ any fidun group, 
| pepor ting back. WW-calse: Ghat “PE Finties THT ddfe Os retatively simple 
charge: to’ sake, at thougtr tr COO d yet “Inte thoUb IY with posdtéle’ ‘seoigui ties 
batheeh nowinetive aft accusdtl ve caves tot nite siriguidr, feminine | 
“singular; and’ pidedte. Ae TR ‘taiidil ridti,'"'e butts Yakél"eoue’ extra’ mechani sm 
to Renee tHE, but! a etkp le ehange fo altod nt ‘tcl tication oF « parse 
(ive. “paree siytting tut a re Hatt ‘houuh roup") te iui’ be ‘wuffictent. 
Atlowing the: nett grous epee rattét Yo Wa called iithout case” 
specification would ‘laerove the efficiency of VeRe-CBHECTS and several other 
routTnee; but it would net chahige the etrudtire oF VERE 
it stande hou, VERB-OB9ECTS Wt cat ted Beth het the! in verb has been found 
Grd WNGH It haw Hiét'beén. The tout inte Works Troe” al" shopping ist found if the 
Regi bter FILTER. TY tHe main’ verb’Hes’ bien’ found” then FILTER Ts a liet of 


JECTS greatiy. As 
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the sorts of transitivity tupes of objects. it may. teke, (Thepe.nere daacr ibed 
In section 2.3.13), If no main verb. has yet. pean, foynd the apin. clause. 

progran can usually tel! whether we are lopking ; for ative or. peas|ve object 
: types. and set FILTER to. these. In something like, Ay Segondecy clauge, FILTER 
te set to al} possibilities, of which there are currently 26. Note that. thie 
ds not as inefficient as it evens, since any gttempt.at parsing @.at.at. 
objects wil | give ie information, that can be used,.to update the possibilities 
in FILTER. For exanple, FILTER 10 eluays, ocdaced, from,engest to shor teat, ; 
with noun groupe considered longer then argppRitia0e, Lise MtA Wefone AAP 
before A before P). Thue, if et ony point ue,.tall. te. find.» nown.ereye tor @ 
second object, we can.eliminate al! double noun graup,.tupes. from. FILTER... With 
the exception of the problew, with noun | GrOUD. Gene mpegi.tication. pent loned 
above, parsing of verb. objects proceeds. in.an, secsee | yand fain ly efficient 
| fashion, even when the main. verb has net. peen. found... rae 


2.6. 4 Limiting the Parse — cee es fate HH . 
__,bhen T etarted this project, | najvely. thoygat that pereing Germen. would 

_ be 8 simpler matter than pacaing English. The ,reneon,.for...this pal tef was. the 

very thing that gave the morphology component. such. @ headaches..the ebundance 

of case, gender, number, and persgn distinctions... Natural | 


are very finely balanced. The Gernan-eyntactic components are carefully 
tagged because word order has @ much wider, degree of flexibility. J already 
knew that objects of the yerb have sore freedom in German. than.in Engkishs 
"Den Mann kenne ich" is perfectly tine: (4, nos the sen," literally, “The man 
know I"), even "Den Mann gab ich es" is not eurpriaing (1 gave. it.ta the, 
“man,” literally, "To the gan gave I it")... what geve <the penser the anst - 
trouble as it was being developed Here things. like the passibliity .of. as 


fined; ae-wel | -aspreshisied,- prepeni ttonigroups, “adjectives that take 
_ preceding feun greupe eecebjocte,“cleuess.appesr ing pranosinal ty, post-fixed . 
 genitives thet: @ecetrked ay dese Sut have nd ‘Weiptat ef wicker ‘ae in. Engi isn, 
“eter, etc. In thie section, I would fike to run through a list of the wethods 
Used tovkéep. the parser ion thentraths:: Senecet thede meseures.wene:more — 
desperate ‘then athere, tut: trlemutemtuteest hendle:thes- ine uniform. 
manners Note: -thatsone -other very: suportent- mechani om dor Sela dind ‘the parse 
‘be sendntice; wabeh:Je.dbecudesd: tn thenent three chapters. 
"Five 4iret eetiot methode ‘ter. | imiting the. parte cones built into. 
 PROGRAMMARS Ht (a temey.to move.aroand.the pares ‘tres and to interrogate. the. 
nench wird tn thecmentence about 4th .teakiete: The.CUT wir isle ceabe set to 
prevent pareing Seamigoitig ‘pastcqccertain.pbint.in the sentence. Finally: 
meucage var abies Genvoeset, -e0 -thatersdeone:ter-e4eiture canbe recorded. 
One aay. neesage:veniebiaee-were-visedsthequghaut ithe syeten use-to prevent a 
; second parse attempt.uhen the: finetione.nad already failed. For exemple, if 
Ween! | APARGE-NG AGC) and.2n accusative neun group hae already been 
 wnewocess fully attempted at this point, the<néun group routine returns fablure 
_ iebthout: amy -reparsings - Although there. levsotereveltwad:ineetting up a node 
- (here2the -‘cowtighoup node) ashen’ a PARE “callie néde, shecke .at-every calling 
point-would be cymbertone. \-Therefore,: the Individual ayntactic special lets 
nace mede-cqeponeliche for checking. theta lure Jiet. omen 
+ olcaddition to the.use-of message variables, the pragréms do sone 10k- 
shead- in the-eentemon, J: ant sure ubether te 06 happy with: thie approsch 
ernote There da:gething: lnherent ly wrong ath Yookwanwed ‘Tih text processing, 
"but. 1 suspect .theat.4¢ should be sore fully Integrated. Into the design of the 
parser seesection: 2.7.2) «| Mnatever..the case,. the. look-ahead Aised vas slaple 
“9nd. reasonably.atfentive, .As-exempia is the pronominal clause tike “die jm 


t 
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Nervensystem ver laufenden Erregungen" ("the exci tations that run through: ‘the 
nervous system.") If there ie no present: or: pest.pertieiple anywhere: inthe 
sentence, a call for a PRESP or PASIP. clause wid! use Jock-shead and: fail 
immediately. If this were not done, we would probably end up parsing: the main 
noun of @ noun group ae & verb object.and goto a:lat of troublesbefore it 
became clear that no prenominal. claueg.was pragent.. Since prenoninal clauses 
can_occur in just about any. fil! noun group, this could shew down the syeten 
considerably. I should note that the logk-shpad: mechenian here cquid break 
doun in very long and complex: sentences, where @ Jot of:.di f ferent -eyntectic 
structures are present. Ae it is.now, if. there 4e:@-menber -af. the:nord:c] ase 
anywhere in the sentence - even trenty HordeaHey - ‘the: foak-shead:si tf) Se 
satisfied and a pares wil) be attempted. This rudimentary: Jeok-ahead,.: then, 
is not a panaceg, but it doap alien. the pareer te. take: simple ections «for 
simple sentences, . 

Two other pareer-I ini ting. neqhanians are sore conventional. Firsts at 
the beginning of. the wore comp! icated routines, entry: conditlens were set up. 
If the next word’s possible parts of speech.de not: mateh..theee:Im-the entry 
condition, failure is immediate. A second asseure is to-.distinguteh three 
levels of noun group: FULL. SIMPLE, and NQ-RSG. Mhen PARSE ie called for a 
FULL noun group, it is free. to try anything. SIMPLE .neuwn groups .exciude. : 
rankshi fted noun groups (RSNG - like "ich weles, dess-es:uabr ist” / “I ‘know 
that it is true.") Finally, NO-RSO noun :groupe cannot tie RONG ‘and way not have 
‘rankehifted qualifiers. Verb objects in.wajoe clauess are persed with FULL, 
while objects of prepositions, for example, ace parsed.pith SHPLE. NO-RSO is 
used primarily within prenominal clauses te. prevent emkedding.: -(Actuatiy,: ve 
might want to allow embedding to. one. Jevelof PRESP and PASTP ctauses. This 
would be a simple change.) Tha noun group -distinctionn seen -$ine: erch-parser 


Se 


effort. and | de: nnd. fons: tied. they seminualy: ourtelt the ganeraity of the 

grenmer. tuitintene) eebedding strates tamer: comprehension, and it ts fair to 

treat: Bay. enbedding. 26. like: elements hegend 6:60 tein: lave! as a pethotegical 
Finahly, bt ath alae tate, thehe ie tbh, The routines used: in’ the 

not the beet sessitie-ene, lnterentnd repiibe Whe refered: to HIt! (197 for 

an ansiyeie of wnniins-neknone ania dareription at “aabticit beckup.” While 

the backup: restines: im the weemer neve wel etienalityiie the need arose, they 


do hana, sone 96: the: perm onanon: hath. bichagn cid halictainaneds 


i. Te. 


Spne ere Se: Ae SaRRar 120 Ime Saal bie iabiasebonn tide ns 
wietehenly. parned-an aeune - @ probinn thet aiid not ar tee Tt the system 
distinguished oetuaen upper and loupe case letters leince'owe are” 

_ cophtel ized: te Geresdh.  AANG-DAEIP te: tebted 1F theabin Verb. Ye persed 
after bte sijeote-and 1f -thestrenmitielty paibrbtfhtled ofthe vere de not 
watch the objects tound, Thie-neutine-pand nodes thie the ‘tree énd sete the 
naceuairy regietens for another catt to the WMRGRUENTS outing, 


jjedte-oF 9. selhi-cimie heave ‘teen 
perecdaut if thene-is ae par tiolplerer gu plusciitiitive te te found: ‘The 
routine checks: to ans ahethenthe vark une qnilicted My Ene ot the objects of 
the verb. Lf-am, dt pepe thaobpecte, este cet petit et the verb, and! 
returns: contrel te the CLAUSE routing fer ameter ty?’ The teet beckup 
uhich the finiteuges te thenainovert. Shige the edjict follois the verb in 
_— dnverted-onder, there ip setttlng. te: prevent. * pap jyoet from aovorbing 

. prepped tion groupe and clbjedte. oe } ‘He Stan” he sane ie true 
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of secondary :-clauses, but this je-handted.boy: TRANG-BADGP.).: Serantic checking 
helps..some, but often it is:not unti! the-paredr setustiy-triee to paree; 


- objects that it discovers that eoxathing-iaowneng, .:As-thesyates etande: nou, 


_ @ subject noun group will not sistakenly.apprepriete-a peepesi tion group. (for 


an explanation of this, see the next sectionke wut Lt could .ptew-up.@ verb 
object, thinking. the object was.a.gealtive. .:1t iecthe jee.ef SUBJ-BACKUP to 
intercede if the main verb, comes .up. short. at eb jecta, .and-te check to see 
whether the subject ie. holding en to morectheansit should. ..Ooject problems 
with inverted order clauses whose finite verb ,ia.not the.main.verb are, the 


same as for secondary. clauses and are.handied.by TRANGBACKLP. . 


aAEs3 og a oe 


27 Propless ead Qoservations 
2.7.1 A Sticky Problen.and a Partial Solution... Ma 


One.difficul ty with ueing:backup for natural Jenguege pacaing is that, at 
any given point, it is not aluays clear vhether beckwp.ekould be initiated. 


In this, section | upuld Like. to discuse-a,preplon:uplebgecurs ia Eaglish, but 


Which appears itn much more florid fora in-Gernen. . The -selut ion. proposed does 
depend on backup, but it attempts to winimize the inctances where. incerrect 
parses wil! remain undetected. Ps Bi 1S epee 

_ Conaider a secondary clause from our-eueeple peragrapht.. 


Da aber manohe in der Tiefees lebenden:Tintestieche in verechiedenen 
bunten Farben erstrahiende Levehtorgane Pepi taany: ees 


But since many deepsea duel ling cantelopods: possess Vigne organs ‘that 
shine in yar: que. Jor ight . calors: ome flea Frum: 


The structure of this secondary clause is bbinder-inter ject ion-sub ject-direct 


ob ject-main verb, and the correct: division betueen soup ject ‘ond direct object 


“ie after Lintenfische. A. Progras parsing | tong blinds | houaver ‘could 


5h 
appr ope inte, the pemmanhtten: group: “teerachiodenen menten Parben” sea 
quetitler ter-the eamjent ‘Hiotentiachs! Spntestieaity, the paneer uédid never 
be: the uiser,. singe: “erstrentende:teusnter gene? evecpartect by" geod ‘eccusetive 
noun group. ‘Sementicalty, toe, 14 wowdd teke-e faletay ‘soph taticated “program 
to kno thet someting dae entien, 

In aw attempt to anald thie eertist'ettuation the noun group spectet'l et 
‘adjective groupe ten ABC POSTION: ‘that ab sit -nefeng to it, it wilt refrain 
frow doing ec. - ‘Sueh: prepdeitiét greupe, “etc. aft thew often end up bourld to 
the major or evberdénete chaise node. Thke tendt a-tad fate, becaued Tt, is 
often tre correct ane. Since academic Gernan meken amh frequent use of the 
prenominal. clause, tt te: tose Eikety that dliepte-retations will be expressed 
with pestnoninal quesifiers. 1, on the other tilt; tile is the wrong 
‘declaton, we Gre: bt a vetter ‘piel tion te Fild thet out. “Sines siajer or 
subordinate cheese: Wabi to: ta there: fet bgp nei tae tnan thal ‘component noun 
groupe, even e-retetivety. néek ‘eunintice-comperient wlght ‘pe ante te determine | 
thet @ clause nae-aemething extra, event ough Ft @ighe"tall to do eo for a 
Note that in the example above, assigning "hi Werechtedenen bunten 
| Farben" to the cuberdinete eteuce Houtd ba teedesect, “sino ft réatty belongs 
to the preneninet ctmses-nodi fying: the aiveat ‘euject. ‘Sinan the ‘wenantic 
speciatiet tor the clause, SCL AUSE, te oat lack, it with presumably decide that 
thie prepesition group cannot modify the-vetation represented by ‘the verb 
ies tment and transfer contre! to @ backup —e Becaquee of time 
timttations, thie. eeckup. code hee agt been written. but its hesic tack would 


be to detect jur tedictional Gleputes. We siveady bo thet the -preposi tion 
graup dose not sedi ty the clause, but ue have to check to see uhether the 


Sl Eee ea ee ae aa eee Sea i 
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componente on. either side.nant to.clain it. With aouaiehmmle syntactic 
information - the entry conditions for diffeneat eteugtunes end.the.saye they 
may terminate - we could eliminate certain claime on the preposition group. 
In the example sentence a simple syntactic check is not enough to decide 

: whether the subject or the direct object should get the preposition; so we 
would have to depend on the fuller syntactic check of a parse attempt, with 
the accompanying semantic checking. The backup code could cal! for the two 
different parsings, uhich in effect would allow the semantics programs to 
direct the parse. This echeme is similar to one out! ined by Woods for English 
(48); the difference is the heuristic that scones that modifiers belong to 


the dominant clause unless proven otherwise. 


2.7.2 Other Approaches to the Problen , 4 
When ] began writing the interoretive grammar, the question I was asking 
was how to adapt Winograd’ s approach to hand! ing German. SHROLU is not the 
only way, however, to translate information like that in eystemic grammer 
networks Into a perser. In the tine since Winograd’ s ayeten appeared, there 
has been some intweatine work: on Engl ish parsing, notably that of Martin (246) 
and of Marcus (23). Martin's approach takes the form of @ parser similar in 
spirit to Wood’s transition networks, but which always expands ali possible 
parses instead of attempting to choose the most likely one and backing up. 
Martin contends that sesantice will iialt the number of possibilities at any 
given point, 80 that combinatoric explosion ie not an iseue. Marcus proposes 
a "wait and see" approach, with decisions delayed unti! the necessary 
information is in. The ability to delay parsing decisions would be 
particularly useful for German; it will therefore be interesting to see the 
results of this work. The merit of these approaches is that they avoid backup 


56 


with its accompanying problems of programming complexity and the possibility 


of overlooking ambiguous cases. 
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Chapter 3 -- Ordering Concent Mapkere: 
3.1. The Conceptual Structure. 
Considering language as a "structure organized te convey meaning," we 
have discussed the. structure pant to. same: extent. and: now. Hii. turn. to. the 
question of meaning, By the meening of.e wards 1 mean. the group of entries in 
the conceptual structure associated with that word. But this, of course, 
means that we now need definitions fer “epteige"® ang:"conceptual structure. " 


Another word for entries here .is concepts, and.it..ie, en apen question just how 
concepts should be embodied. Shavid u9.uee.rulee.Aroceraty. Images, some 


Combination of these, or something alee altogsthen? Ghouls the organization 
of the conceptual atrugture ba a net with ne, constraints. on, | inkage, ar.¢ 


highly structured hierarchy, or, again, something else? Since.no cayponent to 


do understanding hap bean implemented here. my enauers to..thess questions wi! | 


have just enough epacificatian to mettvate other ehalseethet suet be made in 
the eyeten. . oc Meg | Weve! 

_ For. the proposed deductive:.compenent, meaning. canbe defined in:.teras of 
certsin eymbols, data structures, and. programe... Entriiap inthe conceptual 
structure are theerans:.arid.gnnortianacits @ deduct lve: pragranming | anguage like 


Planner 418). or Conpiver (26). For: the mest pert, these ereptther directly 


or indirectly associated with entitiescattet: poacept merkers.... Within: the 
ayetem, the concept markers are priaitivess :-seeertiens in -the deductive data 
base. are built from. them and, aa mentioned. theorane.are seaociated with them. 
It is easy to Imagine another level in the syeten with, say, vieual jnages. 
The cancept markers seuld :then point.-to. these-inages)-snd progsdures would be 
avaitabie to do operations on then | the:imepection, updating, aimple. 
manipulations like rateation, ete. _- 


SER NRE TE ate rie 


eugene : Lo+ = TRRpLr ERR Gaertn cee se RRR Pe fie en mace + 
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The chapter tit: tottens iatconcerned i th: fie-aay concept markers should 
be ordered, and.shepter &. goes. inte sore detall on the kind of. processing we 
Would expect frem.e deductive coaponent.« ‘The-téeee @iecuesed here are not - 
original, gue £ hdek thie chapter) aid chépter &: +d be hetptul in 
| ‘bing soue-od- the shenuns: thet: ngiie At be @ ether 
the ‘syetencare shar do interact sugges Vith @ deductive components 


~ 342 . Objects; Ratatiine, mnePrepertics | 
As tndlinggral.e ayeten, the th tige ‘tin SHO WOrTE aI be represented as 
 ebjects, reietions, «and properties: Nate thet ided tres citegor tee wil! be 
used: from Hon on enly ae dtetinet Hone al thid te dandeptual structure, pot to 
name things invthe weet world: Te svieetents Ba MPL Fig to consider these, 
three catagor-\és ap -priel tive, celnoe vit Te eT HGult 49 tome up with water— 
tight deg We:tiona, = Amin. attempt Hedeer, Wit Ue ess that tw process Of 
ereating .&.concephyel ab ject vetteat Healy id iPterantiagey the: part of the-wor id 
to which it corresponds from the reat of .emeriencs (uith experteriae: ant wor Id 
> yleved braadly + net just ‘reel hy cutohenginany tad is, etc, an well). 

An object, then, qoute to2ters- tame: Ue hapten feet hy Mepresented anything 
homed: etat leat tay sthat le, -any BheMmNaReenetteneS We an entity. Ari object 

might Ree physical:gh ject 4 HeeWTRUCK, ian -fibti peeiowsi.iiew ACETY-HALL; 

- mental sphenqmerse 4 Hew 6:GORBAN; some ienetr gt entity ike #TAUTH, etc. The 
sherp-eign.here:|e:-used-to-dietingui@h eommept-merkerd,uhtoh ere part of the 
 systen’e gniternat eonceptuel steusture;: fron pia he iors, While are part of 

» MONQUERRBe en Shae tates ag: i less PING BO g Ea By 
A raletion Jpinde one or morecentitios):the amet type of Linkage 
dependiag.:. ot: commen, an the stype-ef c@tetion bowed Foes: The entities bound 


may themselves be relations, may be objects or praert tes, or way be some 
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combination of the three. One commen type of relation is an event; events 
always Involve a specific point or. span of tine in. addition to their other 
participants. Properties are relations ui thine: segunente: ‘This te the only 
way that properties distinguish thereselves. from ether relations, but the 
distinction seeme to be @ useful one both conceptualty and for purposes of 
implementation. Properties ere used to- describe: end. mod? fy objects and 
relations, as well as other properties. To represent this, the system has a 
special relation (called, not surprisingty, ‘MAVESPROPERTY), uhtch tinke 
properties with the concept modi fied. 

Our three conceptual categories correspend: roughty to the jobs of 
different Gigsinehie structures: noun groupe often. refer to-objecte; clauses, 
preposition groups, and some adjective groups répteeent relations; and ‘most 
complement and attribute adjective sais cagpeemk Gaeta. Thie- 

. correspondence should not be teken to imply: that objects, relstions and . 
properties are derivable from syntactic’ categories. The conceptual: structure 
is an independent level; in fact it le not surprising to eee thigh level 
semantic distinctions reflected in. the syntax of at isast English and German, 


and probebly aii languages. 


3.3 Selection Restrictions 
Considering the huge clasees formed“by cleselfying the world into 
‘objects, properties end retatione, it seems likely that particuter elements 
will be more useful and accessible if the clesees ere structured in some way. 
Since any choice of structure should be influenced by function, let us look at 
the way the system wil! want to use information about the world. Basically, 
there are tuo kinds of activities thet we want happening in the system. One 


is the use of semantic selection restrictions to eliminate incorrect 


6g. 


intecpretations.. Tae: ather is..the vesed. the full range: of real-world 
infocmation..to nakm.®.final. choice-betuean interpretations, inetuding: 
8 deternining: pr onamn-regerenee:. etc... These tue: functions: make: different : 
dewands. in. terneied the: kinds: of knontedge-etrweture: that! ewch can use most 
020i lin, 00 @tecdeion. co: etouctunm:raquireciaccinene lack et eemantic  _ 
restrictions: end: dmductive: interpretation i 
| Satectlen:reateictions. indiaete: thecuter Limit on what types of 
congepts say anpear im-.a-retation together... For. exampte, 

Das Stich Seife ist liebevotl.:. 

The ber: of: sop. is-affectionate: 
is. an. odd. sentence:becauee: affection Jacan ettributecot humatis, nayoe of. 
animale too, but :lt:certain! y.can:.net:besen attribute of a non-living. thing. 
(This. ignores, -of-emar se: personitiongion;: but:-thheug om, a8 Wet as any 
sort. of mataphor Less. epeech,. wil | not: becconstdered nom See section S.8 for 


- & discussion of seme: of the iseues: involved.) From this-exampte, ue wart the 


selaction restr iet jon qsacciated:isi th affectionate: tecbe: MiLVIND-THING. ~ Such 
a.selactian reetrietion.gives.us a criterion for rejecting bad peceee. If the 
semantic component. ever finds iteett trying to.l ink affection to soep inva 
straight scientific text, it will find.no possible mpatings of Lisbevoll that 
wil satisfy. the selestion.reatriction.. Semantics will then fail, which will 
cause the syntactic compenent to tery. anether pares. 

In addi-tios.te-rejeoting, a.parce .becaues a ‘weanings of @ word are: 
accentable, the. semantic coamponent..could alee ues satection.restrictions to 
eliminate: passibie meanings.of a nerds. This -dose-aptuetly happen, but It is a 
rather tr igh business, _ For. example, we might expect..that selection | | 
restrictions, could,help, ue: out in. distinguishing teteent 
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_ 4i) Der Film gpigit heute Abend... 
The film is plawing tonight. 4 
(11) Unter Lichtreiz aplalen dig. Chrosataphorso.. 
. The chromatophores play when, stimulated. by. light... 


For the meen ing of the verb anjelsn In ML we might epecisu that. the first 
argument of the relation should be something | ike. ATHEATRICOL-PRESENTATION, 
i.e. a film, play, Puppet show, cabaret, etc, It uoul¢.not.be yoreasqnable to 
expect the agnantic markers for these phenppena to be. classified under. 
THEATRICAL-PRESENTATION. For (11), however, It ig. hacd. to, pay wnat the first 
arguuent, might. be. The lights on 8 marquee cen. play. in. .thig sense, the. 

_ Northern Lights can, chromatophores, of course, do -.aven sounds can. | The 
unifying characteristic here seems to be. thet. these. things fore @ system, 
uhose individual meabere parfora their particular ectivity, (flash, euit,e 
sound, move ete.) in on, apparently. (to the perceiver) random, order. This is 
obviously net a simple characteristic, and probably. t the pest. We Gan. do for a 
selection restriction here is eomething very general . Uke, WOONCRETE. Since a 
th ing that is a THEATRICAL, -PRESENTATION would also be Riese! fled under. - 
WCONCRETE, if we are given sentence (i), selection.resteictions alone wii! not 
‘be able to tell us une ther the film ie running at the theater, or, whether we 
should expect to see it flashing on and off,. For this.reason, @l though the 
selection restrictions in the eystem do régemble.tha.semantic sarkers of. Fodor - 
and Katz (6), they serve a.different purpose...in thie .syatem,..the selection 
restrictions are not expected ta give.@ full agcauet, or, even the major part, 
of the meaning of a word. Something as siaple.as s selection restriction will 
not be able to represent semantic constraints ae. to which participants may 


take part in which relations with any degree of .seouracy,..What a salection 


restriction car Of fer ise negetiveroe Heuetert for wti@l rating ‘Thpossible . 
f\' tor giving feedback to 


interpre tat tone .wde tits Moy: they:-cantie: quite-aee 
the parser, and sometiaes; tut fot aliaye, they can be. muieret: in eliminating 
impossible interpretations of @ words * 

More detail of how setectiot reutitatione-ere-weed will be given in the 
next chapter. BiNNeli OSU Becuedl on ee far; Rit hs Mi ight predict that 
- selection restrictions canbe tede 6 Herk moet: Fattiolentiy with @ strictly 
hierarchical etdctut: h elngte tree has bien ude Fer - sétectional 
restitction processing, with the: Hetattter beh: 1Weele’ on’ the’ tree the 
' gérierat One of Clade membership. Note that & is ottgle claseTtication coheed ie 


not the only choice for selection restrictions: ‘Multiple tress are a 

Possibility, ae im Winograd. Operations oh & single tree, howéver, can be done 

With a mininue of tine and effort, “tBnce the ¢ reason for using sélection 
restrictions is efficiency; It scens-twne the beat thoice. oe 

: The selecticn*hedteiBtian tree should be Sijentiee'to el Ininate the 

Tangent nuaber Of leita dibailia ‘ae etten as poesintey- ‘but beyond ‘thie goal 


ae vel ‘Batance: and: ‘a matter of 


taste. Given @ Génetant number of Seneep tt; late ‘depth means a 1 faster p search, 
wore depth maérie' that tha sefection bestetottine’” con die’ ‘Finer ‘distinctions. 


The feature -tite tite: desert dh herd ty tne hiéterdhidel: structure. Thus, ‘cur 
RGR nT git: Haver ae-tOp Thode | shetéd: MIORLDS Wibid tried Badcordents dre 
MIB IECT "WRENN, “ahd NPROPEREY The! tiiell totes “SY thie vélation euires 
Ore “WHOnrE He Pigene” Heres WTI CREAT OM tic fasss iinet fike spatial 
Petathee;? ‘retatl atte oF exnaser on Ate Sim fak RO AT, ec WENTAL-PF 


-Thetuded: hitébrietudh, “edt fonat; tid “pares toe! ‘ees 


‘édaises. “Uthie orgenization 
He tener freer Half dey “(164.2 Ftjuee’ $2 “anbiid" thé Ubper nodes of ‘the property 


Bo oe Sees te 


MEELATION. 


WEVENT §- MMENTAL-PROCESS s#STATIC-RELATION 


Figure 3.1 . \ 


_ MMANNER-PROP | #SPATIAL-PROP ~PROP | PERCEPTUAL 
TIFICATION —#IDENTITY-INTEGRITY 


_ Figure 3.2 


properties that anawer the questions "hou", "where" and “when”. 
HQUANTIFICATION deals with extent, which is either..number, for objects, or 
intensity, for relationa, LAIDENTITY-INTERRETY -conteine:praper ties like | 
wholeness and uniqueness, while #PERCEPTUAL deals with any properties that can 
“be detected by the senses. Note thet thie property classification is not 
exhaustive, but it does give an idea of the organization: of. the tree. A 
larger section of the tree used in the system appears in Appendix C. 


3.4 Structuring. the Wor|d. for the Dedvetixe.Compenent 
Since the selection restrictions.can provide. eniy: negative: informetion, 
we would also like a flner-grained check te reject: the reet of the possible 
interpretations. There should he sone positive criteria, s0::that we can. begin 
to know that our chaice makes sense... For this ne.need the full power of a 


oe) Pr ad . PES Paden SO Sate Tae SR Lat, ce aN Rea Rie y 
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deductive system. The queation sa 4e0uerking on is hew the data baes and 


theoreme of the deductive syetes-eheute' te orygentsey. For a start, the 


Ppare ar 


ARTES NATAL 


#00RSE. ATIGER WANT, = AEISH «=. RG = MOCTOPUS 


classifications: veed by the. e¢leetion reetrlotlore 1osk useful. Note, ~ 
hevaver, that. tn-onm centert Figure 3.3 sight dee usbtul classi fl cations 
scheme for objects, while another. context. might faver Figure. 3.4. Even given 
the same: generai-centext, |.0:. bletogy;:HeychotOpiy! chadidistty, or the 
cunermerket, miidhi ferent clevelticatery: sehabes are'postible. We want more 
than: just. 8 singe: classi ticetion echane for: the general data bate, sv the 

- etructune, usadcul l4; be 8 group of trees, Actually, “etnce thé trees are not 
completely disjoint, ue con merge them Inte: e single lattice- structure. ‘Doing 
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this to the two classifications above, we.would.get the clasal fication shoun 
in Figure 3.5. 

What does the hierarchical structure.of. the deductive deta base buy .ue? 
Firet, the motivation for blerarchy le ecomomy of definition. An OCTOPUS is 
an #ANIMAL which 1p 2 ALIVING-THING unich.ie, ACONCRETE which makes it an 
MOBJECT. Without a hierarchical, structure, se ‘Heuld.be specifying, for... 
example, that octopuses have sore necheniea. Of locquation. (property of being 
an HANIMAL), that they have. some sort of ceproduetive. sunten. Apraper ty of 
being a MLIVING-THING), thet they are Percelvapie, althaugh possibly aided by 
instruments (#CONCRETE}, and that they aceq.entition rather. then ‘procesges or 


4 REGS Y 4 tee 
- ome = 


attributes (property of teing en MOBUECTH. Ali thie meatd be associated with 
MOCTOPUS, and muct ef the cane would heave tobe duplicated for #MORSE, 
HEEAGUEL:, and the feet: S6 fiom Nrerarchy He get edénedy of ‘Storage. 

In thinking aiéat the syetin; t halve ttt ted ‘mytitt# to a single context 
deleting tres, ar resryanizing cuncuptuat structukes, these are interesting 
but major probtetty refuted ae they eft 18 prédbies 7 
hypetheelzing, ahd enti ting from one context te shether, A working 
 tranetatter syotell weoid rakulfe e'nnonvacge vituctute of fonajabals greater 
frextptl ty thar whe ohe outlined here, 


& (tke Tearning, 


It should De mentioned in paeiiNiy: thet neither hierarchical organization 
nor word GEFINTCHENe Wekeelves dutombtically Vapiy difficulties with 
border I Ine: ceed’: Say. tor inetence;. Wet ‘he Grieertan for *living thing” is 


self-reproguctionm If viruses repridace thes aw vee bat fer other reasons we 


are retuctant te cet! them biving tHings, thet thie lee dilemma only if a 


singte defining prep 


Pty le att thet we Sila ourselves. In fact, it ie 
possinie to set: up & group of border life categories defined by appropriate 
mixtures of properties from the wo jee" ‘Categories. : . 

The Klerercitcdt order slone, ‘henever, | needs 


‘guppienenting, since we 
would like te be avie to expticitiy represent other’ ‘Ordered relationships 
betwsen: corcepty. THY additional structures te be Thtroduced will lead ue to a 
Genera’ graph structure with né #eTt=Fodps, but the présence of our under iying 
hierarchical structure will cofietrain end control the finished result. Since 
the lexteon ie going to be interacting rather actively with the conceptual 
structure, it might help to liek at the oi tuation for vords before any 
Sdditions to the structure are made. | 
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3.5 The Relation of Words te Coneepte 

There are several ways of @pproaching the ‘prob lem.of word definitions. A 
word might be defined in teras of @ class .and.cone numberof attributes. This 
sort of formulation is an old. one “genue"-and "di tferentie" are the 
Classicgl terns - and the standard fors of a.sigtionary definition ehous that 
it. is a familiar one: an octapus is “any of -@ group-of moltuske having @-soft, 
saclike body, a large head with @ mouth on -the andeneuntecs, and eight arms 
Covered with suckers,” Here, the phrase up .to.'meliuske!-represents: the genus 
and then the dif ferentiae. folios, As. $he kr. sane. inphiney, the. different las 
offer criteria for distinguishing betueen -ditteregt uonde defined with respect 
to the same conceptual class. A definition. ike.thig.qould be veed in. &.. 
learning (i.e. non-static) structure, and it presupposes a top-down leerning. 
process. The genus is assumed. known ard the dif ferentiae give distinguishing 
. Characteristice for the new node that Le added. in actual. practice, the 
differentiae may be used te.pick outa enacept, which be.elreedy nau: In 
this case both genue.and differentia are-knoins: aad the problem: is merely 
learning a neu word for an already famitiar qaacept.}. : a 

This genus-differentia type of definitiqamight.heve ite place ine 
system with learning capabilities, .although..4 mould net:be.@ fundamental 
place, since @ simple toprdown process can not adequately modal: a good part of 
the learning that. we see.people daing. .-Siace, honaven: J am not. considering 
issues of learning at all, definitions:wil!l. take a ditferens, form. The... 
knowledge data base is assumed to be static im: the eenes that for the words 
defined one or more nodee (i«.; concept merkere) ere iesys: present. in the 
conceptual structure. Fur theraere, -the different tee \<:those:bundtes of - 
information that dietinguish.a concept from ether mesere of its class - are 
aleo. presumed ta be in the dete base already... Therefors, a word can be most . 


yur paiement pr te Pc ES aa 
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easily defined by-neterence: to the conuept node or nodes: it represents, and so 
verde cen, ir @. senee be" phigged: int tthe 6 
system elreedy nen seut: the-condupte:. Peprecanted tei the restricted number 
of Hordes. 14 enowaiers: no-canceptuel: restrustur iy, te gotag: ‘to be necesevery. 

some gts Rie Re ere ange eee hehiggetOb ley! Gre nance nesew: thet vords 
ere arderad-uith-nesunet te. thecnmunpiual-etruetdre,: Mp te, hoivever , ‘that not 
all sonceptual. dhepiwtione:-weed-be- ration ted-isureafiyy: thet te, some concept — 
Of sentory: i mwRReiRMee ewriire):-langiingn vonetivnlting: Wells nt bagity to 
approximate the muher ef difterent-shedsn of opti ‘rat tad be edoeg van 
and: reneuier éd,, “driest ‘of enue’ ‘teatees 


“structure. Since the 


BiB. Fitiel da 

With werde: Keeeiepe, it leet ine tegen tiie: te an. Goi Pend priate” The 
- concep tui ..atrugtian dheptayed:obei tenition ‘nd dbtfererce jett menbvers. of 
(OF members of: di ttacent: clesses: ease -A'phetiometion: Like: word 
contrasts scaceuiciaair inmate he: ‘hie: te-nestedt.: One: atternstive is to 

eof e concent node, But [fore Ginn one criterion ts 
red evant; Leokkeggi ing: eeuld icone. annoy) tig. And: of course, this ddes not 
even addrese: the elgtiar-propiamof ordering natiee from different: clasees. 
The. splution prsieend:-ie. the: introsuntion: of @ Hnguietie Held. The term 
“Linguistic fhesd":sle-net: new. tors discission-ef Fie nlietéry, vee Robins 
{41,: p-Bid.. On the daels: ot-wihat: os develoued-an tar, the field should 
be an ordering.of. pbjest, retation, or property euhoapte-usitig & property as a 
cri tenion... He. de. net-uant te think coh a:tield-as onder ing words, since the 


of w éingle class 


order ing: here. is. asmentic .and. thue-etrietly speaking ehould not deet: with: the 


words themselves. 

To represent word contrasts, we cquld use + and:=- at the extreme ends of 
a field. One special sort of tuo member figid.represents the: case where the 
elements exhaust the donein of the fleld’s property... for. exaaple the case of 
negatives. (There need not be any middie ground.keaueen “big" and "not big", 
where the criterion of size applies.) -Antenyme:present an interesting 
. Situation. Take, for example, good" and "bed". Heres:and in the next. few 
sections, I will use English axemp ney nines :the. leeuss-diacuseed seen to be 
independent of the language ineeleds ¥ Heat poop le wewld cal! "good" and "bad" 
opposites, and so their associated cencept markers ere.aliated a tuermember 
field along a property | ike AJORTH. ..But. wnat happens..when.1e:.expand the 
context to include "great", "excel lent", and." heuayt2» Heth reepect to these, 
"good" and “bad” undergo.a aubtie shift .in meaning andere. ne.-tonger the 
absolutes for which we Genstructed the .tuo-nember,.f laid... Such word pairs can 
be called polysystesic. We therefore need one.or mora edditional fielde for 
the different frames of reference, and, the process of. interpreting a word may 
entail deciding which field is relevant for the partioulerueage. The 
implementation details for fields seam to. depend on. the way knowledge will be 


structured in the system, so fields will not be considened.-fur'ther: here. 


3.7 Synonymi ty. and.Conmngtation 
In terme of the conceptual structure ve have been describing, synonym ty 
can be defined as the relation that holds. bethpen. words that.share the seme 
associated concept marker. Given a eet of synenyme,. the: present structure can 
therefore represent. their. similaritys the. next etep. is: te jook into waye to 
represent their differences. The first question to sek is whether there exist — 


any pairs of words that are interchangeable in every case. Consider, for 


78 


example, the words feline and cat. ‘Trat these are synongue |e -exhib! ted by 
the: Aect thet .2-penaen noute be-uthlting-to vee él ther to ‘dente the standard 
furry onbeal sith: deur hege and uibekers: faling, nanever, “le a ‘nore ‘formal 
Word snd ane shone uenge woutd-tend-toibe ristriétadts ectentitic or poetic 
sepotenten gat dep sgenenat: pengone.sends: Tne Melt merce ike thé ‘témi | ier 
one petueen connptation.end.adesstation. ‘Inighteral; ene would want ‘to cal | 
denotation the cangephal aeeming-of Aterd = sanatt st neve Deen tatking “about 
eo fer ~ and .cometetion: the hasiantned sesoctated 1th the ‘use of the itord 
iteelt. :Connetation tel ls as -gneut -tee Srame St eetinance in ‘uhtch a word is 
ibeing used and gives inforastion ameut: ae: Sin inated formal or 


sxtibits euch ecmmaiy in other ‘rans hag. Berden eiiitina! uae eve Frey 
{7)) thet: it ssid on -aunph ising 20 tind ewtreemgente at ‘the lexical ‘Tevel. 
conpistely equivatent. die-have s pelr of words - Gat and teling - that seen 
virtually dvtershesgesbte «ith reapect to denctation. The difference cokes, 
of couren, at Ahe devel of comotetton, and it sees that this will be the 


case for ali sycenee. a a 

To get this mew connctatien information into the model, we introduce 
another sort of field, Fhis - theld: takes synongnous werds and orders then 
along preperties 1 tke forwal ity, techntosl ity, ete. The definitions in the 
eyeten use only biteny properties, 1.0., + formal, #-technitel’ ete. In 
another-eyetem thtt deple with a wider context adit hee w Picker vocabulary, | 
‘suspect that one would want greater expressive fresdem ‘then is given by binary 
categor ies. . 


2 gigas sta oa Dhar oS ga Re erg one's SRR rie Bere ae SUSE Sat SR aE 
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3.8 Cheosing Concept Markers. fe 

_ In the chapter eo far, Wa have discussed, the way concept markers should 
be ordered, but there has been no indication of how they should be shosen, in 
fact, it ie not at all clear what a net of espantic. markers. should look | ike, 
| Mt... Structures 
bullt from concept markers are of central importance.in the ayatem, so 
although the fol loning discusejon gets. shead of the aypagition,..J. think it is 
important to stop and. consider some of. the, iapups Anyplyad,.in choaging concept 


\ 


because they are used in the system by. more. than, one. campone 


markers. GEN POSEN Gar Si5b aia i 

ment. Hill. construct a semantic 
reprasentation. This will ba built from, concapt markers. euppljed by word . 
dafinitions and the eamantic specialist routings thenag!ves,.. After a semantic 
representation hae been conetructed for 4. sqntance, lta compenent concept. 
markere wil! bp used ta call deductive routings samelated ulth them. It is 


For each. sentence, the semantic con 


the job of the deductive component to pick the most Likgly representation and 
ship it ovar to the generator. The generator. In turn.will. age.the semantic 
“representation to produce English, .end, for thia, .the English dictionary.is 


ordered eg that words ere associated ith. 


ae ee 


ible. .warkere of. groups of 
semantic markers. The concept narkers,,.then, have @ fourfald role: in source 
| language definitions, in target language detinitions, ain the semantic 
representation, and in the deductive data hase. (A closer Inok at a fifth 
role, that of selection restrictions, is.deferred until the next sectian.) 
Given the various roles played oy the .nementic garkers, lat us consider 
the choice of a dictionary definitlon,.for the German.yech pregnen ("to ...- 
break"), as used in the example: . ee cee ee 
Fritz peach daa Fenster. / Fritz broke the window. 
First, we.could use.a special semantic merker BREAK with a.stenderd 


7 


defini tion procedure’ for retations: (eeu eietion 4.2.1). This definition wil 
Be cOMpECT, caniy: He LarTite;, artt eaey: for’ a palfeon to: Meee Ui’ tHe other hand, 
ther Marker ABRERE 1 eb ratier Night level ond) wiker we IgNt choowe’ Iie teat: to 
wir i te: the: ceti rel Chem fit @ epectat fon sibilities to” ‘bute a chun of 


- Lahgweigie dbpendett: tite: Te ct dibloue- wert ¢°TH: Giiseret,. end per tioularty so 
winery: We" arex degli nga hth: tue: pecagiiivet ere in cio eysten. ioe 


expan tet: tr acitet te gunerete® teder Tit ithe fharthdltiel 10 Xd “the cies i ve 
component! (emcapt! Tit cbrtatt: spit! at: comin inter iit Uéafédueeed?. The 

hae tid olipre: (tos Teve) 
i mii en 

One: poet tiie: solution: agit’ ber to use: this weed 6 rephentit tet ion offly 
unetinatenti y: spuredh aiguage Uaiieriddit: Cadietfed, “they: Wodid nit be ‘choosing 
betueen: representhtione; but: — ar edt carly ‘wich’ oar lieve 
reprewentitton tole: peeved: on to: tie gunerdtiog: “Thté tod-tevei’ 
representation wesaldr tey. to? “appro tiate: language Indipebdenci 
that’ It’ woul: adie: to: bee: Input: fore: gilirate ike te 
dict] onary awit tiiing: coutd-be: ditipiet” atid dit FOAMS ‘dni teat tiie: séte time no 
information: vewld:be: lost! on the vay: to the génerater.. I dla fot choose” It 
for tue reseons:  Firet;. a tow evel represititétind® mere 
target fanquage-définttions. WIth. d lander: lerdt sdndintfe representation, a 
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large number of target language defini tions.noutd: involve associating pieces 
of the aenentic representation, . instead of just a.elngie,marker. with words. 
This. is.a pattern. matching probles. that, given.a lacge.nypber of words, could 
sloy down. the generation. process. intolerably iat. lenst.givencuncent memory 
architecture). In addition, note that. the (ACAUIGE ¥..0RECDME Y,.ABROKEN) } . 
representation offers ne clear advantages for. organizing. the deductive data 

base. The kernel of meaning. aseeciated: with, (CAUSE, way: pes wary, from.ona 

ei tustion to. another, but, as the; participants, change, the nature..of: the... 
casa relation mey,.ese well. For axanple,.. Comper: She, MOAUBE:: in. SIMS. brake 
the. mirror" with: the: causality inyetved: ta SEewin pepped.the pepsorn." The 
actions sbucc tated with causing breakage are: hitting, dropping, etc. .Jo.cause 

corn. to.pop, on the other: hand, involves putting. & pag. of.it over. some heat 

. source. What I am trying to gat at here js thet witha toudevel 

representation, jaforaat ion. used in deduction. at ldap. have. to: be... 
aseociated with combinations. of. concepts, i.e, higher. levels. .. 

Because of the considerations mantioned,..in the, ayatam. esmantic markers 
are chosen. as. the union of different, word hounderias Jf, s0nething cas have 
tuo. different lexical representations in. Gecaan ap.dn Eagliah exclusive, of 
' connotation differences, that ie), the lover Jevel.cepresentation..is cheaen. 
For exemple, kannen and tlasan .{'to know"). wind..up. with the tye semantic: . 

_ markers. GKNOWU-A-FACT.. and SKNOU-A-PERSON,..whi le, aaa, end wind (in, German, , 
Wind) get the two markers AGALE and MIND. With this gort.of organization, 
the deductive companent is given basically a selective role,.and only adds to 
the semantic representation .it is given by tilting in.pertejin slots left, open 
by the semantic spscialiat routines. The semantic. representations, generated 

_from.thegs semantic markers haya no clein.at al! .to | anguege inda 

the lose in generality is compensated for .oy.taaten.generation... 


7a. 


iS Whew te:Urera Rentrlot lone ° 
. Unen: using: concept: nerkere: ae-setention rebteTotions, it is: important to 
remember’ that: thayg:give:-onty partie: information: F4ueed tn @: system where 
restetetione: areventuadiy:resdundant:. Ghat juetittoatter: ter: thia:redundenty. is 
efficioncyy. aedissueset ear ter: inetherchaptek: Mother attraction: of the. 
Concept: maniurr reetittittomer tay thes thmig ther an-eahigt aie. te‘ get @ handie-on 
senant lea: —- in famt:one-ot the: fan-nage that fe hnen.-ithout getting 
enteng}ed: ine Langpiand-complen-uatiet resi—ventd:Wnombiniges: Sefection: 
restrictions; theny: look |ikece-pronising:uay:-to- use: emianttos to: guide: the 
The: test veratanof ny eystendid-ube: seteetloniresteiction information. 
to guide: the: pareny: abthougir onty.te:e.verycl Hab tedentent. This: use In the 
routine: that: pateoavert objeates- 1 thin vents Met bine: Found,” Ite -restriet ions 

Here checked: to: see: | # they. treluded:e: location: aaianner| or some other 

property, 14 thte:wee: the: cose). prepped ttonst:phriaeee: Gnd adverbs vere 

checked to: mabe: oun: thet at leet: oneof these: ft tthe dtideritic: requirement. 

Lf thie were nottevoese: then eppetad: ecttsitcbuldctite takin, ‘Tike checking 
“other syntectio cbmgenante:-to. ses: if they hed abaortade-tireposition by 
mistake: Thisowme; of course; only tinited:Uesof éimantiie to’ guide the 
perde; but: it: wee. an-eree: where-e: Ot ‘ot teth ec opwioesty netden; namely, 
binding of adverbial wodi fiers, : 


Inthe: most: recent version: of the system, the: parser does not use 
selection restricttons In: this: way. This decielon-umepriktri ty motivated by. 
Germary wort: -ondeth’ “Ne: uae: mentioned: in-chapter 2, fur Engtleh cteuse: 
structures), thecneiey: vers precedes: its -cujecte: Wotan therefor 
verb’ s. restrictions: and use: them to |Gok- for or. evakiate: objects. In Gérnan, 


-eecess: the | 
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however, only some main clauses have a subject-verb-ob jects word. order,..whi le 
secondary clausee are usually ordered sub ject-ob jecte-verb. In addition, any 
main clause with an auxiliary, modal, or passive verb structure will also have 
‘its main verb at the end. Since using selection restrictions to guide the 
parse requires a.certain amount of dipuctice (to handle multiple definitions, 
optional objects, and vanlatiocn ih word order), the Investment in programming 
effort seemed to promise lese return for German than it does for English.- 
Selection restrictions ere, of course, still part of the eystem, but they are 
used ‘exclusively by the semantic component to eliminate impossible semantic 
representations. 

In the current implenentation, semantic restrictions are hung on the | 
concept markers as LISP properties. Since the markers have en explicit tree 
ordering, the restrictions need not be associated with each semantic marker, 
but may instead be tacked onto the highest node for which the restrictions 
hold. This saves space, although of course at the expense of the smal! amount 
of time it may take to trace up the tree to fetch restriction lists. 

In this chapter we —_— made some decisions about the ordering of concept 
markers. First, concept markers were divided into objects, relations, and 
properties. Two main orderings were presented: a tree to implement 
selectional restrictions and a lattice as a primery ordering for deduction. A 
secondary ordering was provided by fields, to relate concept markers to each 
‘other along a dimension. Worde were ordered by virtue of their association 
with concept markers and according to their connotations. The static, 
strictly hierarchic concept marker ordering proposed here would not be 
adequate for a working translation system; however, the conceptual structure 


is now well enough specified that we can go on to describe the semantic 


component. 


76 


77 


Chapter 4 -- Sesantic Processing. 


4.1. An Overvien .. 
4.1.1 The Gmsntic Component ae 
In the last chapter we considered.a.atatic,. penance; etructure, but we 
have not yet discussed hon @. particular seqtence celates to thie general. . 
frameuork. In thie chapter end the next, [will tey to remedy. that situation. 
As soon as the Gergan grammer has euccgeetul ly parsed some section of the 
sentence, the semantic routings sre called in, Their job.te to construct a 


semantic representation for each possible interpretation .of, the sentenge, In 


‘many places, the shape of @ sesantic representation aight parallel the parse 


tree, but at other pointe, the divecgance wi! l.be obvious, Where the parse 
tree is a record of eyntactic relatiqns,. the semantic, representation be an 


_ independent structure to record sespntic relatiqng.that.are.both implicit and 


explicit in @. sentence. The highly stcuctuced semantic representation 
pe end..it ts @.eten on. the way fram the 
syntactic representation to the body of informatiqn that would be Invoked by 


reflects aystematic |inguistic phenom 
the deductive component. 

The genera! organization of the senantié component fol lous Winograd’ s 
system, although there have been some fairly high level. changes. In the 
translation system, the semantic representation. itself. playe.@ prominent role. 
Whereae in SHROLU.the represantation is assentietiy an intermediate etep in. 


_ the procese of building theorems for dedyuation.:here |.t ./9-abeo-important as 


the input.to the generator. To yee the semantic representation in thie vay, | 
have wade a number. of additions .in terms.of the. .intermation:it aponteins, - 
especially in the direction of @ more systematic. treatment of thematic 


features (section 4.9). 


PB; 


dihere possibbe, sted detivitien types Neve been standardized, so that the 
individue!l sord srewedures oan be weed wescriptivety as wel! as imperatively. 
That is, the definitions o6n te -edmined “ny the senantic component, which can 
‘then take any apprepriate actions gefore exsddtii the ‘procedure. A mechanisn 
shes Btwn teen died te -Pendte partial ‘Swibrmatten tt order ty ‘tuinion (the 
ARIRD wearer; wen wdc Sten 427.23. Te eee ic ebipatie 
return to eed the arguments Gt the retettin xe tiay ere encountered. 
“Tre chapter What Pet catie hieiind ih tat idan put It ately 
Se et a eee eee et action 


nae ation. “The ‘actual 


; ‘anand igfid' this is considered in 
section 4:5. BSectiwn 4.7 Creuceses ‘wetkers Wee cade Seong ‘actions in the 


4.1.2 The Representation 

The ssnentic representation ‘ts oonstrocted trom tires sorts of 
| PSS, rewpeutively). ‘These cvdporents ere Mundie WF itoradtion, sid their 
A inkaye retiects dnterreletionshipe within © sentente. In general, a semantic 
representation ie netuork rather thera trys. ‘By diferent lating betuesn 
tuo sorte wt 4d incaye, nowsver, we can always find dn’ underlying ‘tree ‘structure 
in @ well-formed sentence. A sempie representation is shown In Apbendtx D, 


3 


Pe onipegttnns atitine Matsa coe settiine eS See ees i AAR BE AR BER RE A LH RE RA 


and. I will spend. most.of this chapter discussing ub. jt Jopks: the way. jt does. 


peantatians of, the 


The diagrams, that wil! appear below 1 are peat F 


senantic representation, since tha outpyt of. She. semantic. : 
more information than js shown. ...In the. maching, version...) ntornetion Ja. bhung 
_on the LISP property liet of special, stoma produged: for.the.occpalon. _ 

A question should be ralved nora. an ta arechiy.wiat a sempntic 
representation should represent..-.While J. can, nak reqliu,anaver,tbla unt | the 
semantic component has been discusged in morg.detal|,, lat me, Just distinguish 
tho tayels. of semantic Informptign heres. the proppelsionet gad Spemat!s,.. 
levels. Basically the propos! tional, level je. folated to. what. ip. anid the 
thematic level to the way it As. said. The Ahemat ls. Jevg) Sap le, Hi th, questions 
Vike. what information js Important. in a sentence, vnat, the slice mishes to 

_ convey, _and the, asoumptions | he hae about what hig. J lat " tener. knee. . 

semantic representation yeed here je a minture, ot prope! $iogal, ayaa 
information, ond Twill come beck to tha, guastion of, ubat. a cammpt!s 

. Fepresentation. should look ike Dalow. 


4.1.3 Building the Semantic Roeceian tai ae 

. The semantic component, Like Gaul, bas en ews old sha speaniie 
specialist routines (SMSPEC), the semantic utility routines (SHUTIL). and the 

dictionary definitions. The actyel building of, the, raprasentation..is done by 

the SMUTIL routines, and for mugh of, the agnantic representation. it is. the 

di ctionary definitions which make the calle.te SMUTIL,...Jhe dictionary. ... 

definitions, in turn, ars unjeashed by SHGPEC: atten..it has.eet si. the. 

necessary calling parameters, .1t.is the SGPEC routines thet are. actual ly 

called by the grammar, and many.of these routings correspond,..to. grammatical — 


constituents: there, are STCLAUSE,. SHPREPG,. end SHADUG.,....Ihe, noun, praye,.on the . 


other hend, hes a series Of sdaentic weciafiete, SYIMEL through SHNCS. For a 
ard it evaluates ay prenceinal weditiers starting with those Clobest to the 
noun. Control then returns to the syitecttic temponetit’ to parse’ ary 
qualitiers, ond talk GING? dinks test quetifterd te tha Youn, Sr? is aleo 
respormitte tor evsludtite refiitione! hedili, ihith ard’ Het touched unt! the 
“entire noun grdup hes bewn parsed. GANES dititks for reference to other perte : 
“of the text if the noun group ‘le definite.” ‘Alde pert’ of 4hS houn group 
package is SHCOMMOIMD, which hanes ceuipcuha noune’ thet @re not in the 
dictionary put ‘ieee odupehsnt siseents are. iieiligl tial end SPPRON2, 
hendie noun groups thet ére pronouns. ‘ oes 

Note that there Te no separete eonaritic eactetor | ‘tor verbs. All the 
actions nadeseary fer verbe afd doiie by the sftigis Sat 
tite SICLARE ie cuties, the subject, verb objects, and ‘ne 


€ routine. | By the 
sedi fliers have al! 
been parsed. (This ie true ereugh as tur Se Tt goss, but not completely true 
- eee wection 4.7.1.) SHCLAUSE binds the relatian specified by the verb to 
its participents teupject + verb objecte), then binds the modifiers to this 
relation. This. wnole process tis rather steporste ‘Wh pectica, ahd the . 
discussion tater ut wii{ shed ‘some {tape on the Ai of inforkation oe 
hae to process. 

“fiherever the semantic spectal tete ‘ere cal ted in ie wei aaiat their alg 
‘role jv té be pel or nay- diyere.  Shenevib a sein 
definition of aver, 19 bound, chick sii ‘esd ‘nit ‘the wetect ten 
representiition cat be built for tis section pared 


; then genentiee vetirne 


faitire to eyitax. In the systen ‘as |t ‘etiiiis nou, delantice | 
the: pared treeecept to yet information trum it, atthough the Uiimplemented — 
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ideas in section 2.7.1 would involve a more active semanticg. 


4.2 Lexica) Semantic Structures _ 
4.2.1 The Standard Definitions - ae . 
To get a better ides of the way a. castle repcenantat ion. Ie bullt up, 
let us take a bottom-up approach end start yith examples of the three etenderd 
definition types. 


(DEFD ELEDONE 
SEMANTICS ( 
(NOUN (OBJECT CONCEPT: #ELEDONE 
oe +SCIENTIFIC 
LABEL: D1)) 
"SEMANTICS ( rr 
(VERB RELATION” aie 
CONCEPT: #G0-BEFORE-1N-TIME 
TYPE: NONE ORDER: LEXPASS 
| ARGS: 2 LABELY OZ 0) 
SEMANTICS ( Lae 
(A “(PROPERTY ” COREEPTY | AOLOE ” oe 
CONNOTATIONS: + " 


i semantic defini tions here mr fe li, ; 

The Selector: | - on a 

The first entry in any senantic definition is a aptectic, teeture, which need 
not necessarily be the part of speech, au. distinguishing. feature will do. 
The eyntactic features that vere chosen In, the, _fourse of the parse can thus be 
used to el iminate semantic pogeibi|ities. by watching. against the. first, entry 
in each semantic definition. For example, if, achudamen hee been pereed asa 
noun, there 18 no need to consider ite meaninge.ag.@ verb. If wore then a 
single feature is needed to discriminate petween. definitions, then « tiet of 
features may be used. In addition, If more then one syntactic feature list 


takes a particular semantic definition, then a list of distinguishing features 


prefixed by EITHER tay Bs weed. 


The Routine Newer : 

OBJECT, RELATIGN , and PROPERTY routines ere part of SMUTIL, and build uss, 
detinitren soppi he parameters for Galen Peutinds, “taliaed tig the for lowing 
keynor der. = . 


CONCEPT: a ea 
This ts the sementte mirker used in buffdivig ai the semintis representation. 
It iv @ part of the concept structure Ghemmessd te the pitevlous chapter. 


TYPE: = (retations ong! 


Wheto the sendntic ones. 


This specifies the retetion of she m a 
Types are ONE, THO, THEE, WORE, TDerililE, ote 
senantic arguments mey be left under steod ly the surface tepresentation. For 


yer, ond they tell which of the 


éxanple, we way eniiet certain relations: te hove on inetrument speci fled at 
the semantic tevek. The relation mut, then, would have three ‘arguments 
actor, patient ahd fibtrument, In the dontenoe “Katt sthni tt des Wurst mit. 
cinew Resver* ("Karl cut the sateage with « knife"), the définition | used for 
uriderstoueh Fri EHF SehATEt dike fueat™ (Rae cut ‘the seusage”i, a 
definition of type TARRE mould be weld, since the third arguadnt - 
inetrument - le fett undérétaed, ‘Siattarty, ‘Ein Mebéerscfini tt des Wurst" 
("A knife Gut the — Saeed e aatint tine ‘of ee ‘ORE, ‘strde the: Gétor te 
lett witty stood. 


care lett | 


ORDER: = (relations only) — 


This \s another, way to eypply. intergion piatigiching ee mnpenie. to 


semantic ones. Order ig used for werd naire Like aragade and fablade or in 
German vorgahen and folaan. He went these.pains to agp into a single. agnantic 
marker, and to da this, one of the paice jie Labelled ae Apcicg active. 

3 (LEXACT) and the other as lexically pageive (LENPARG),, When order 1p, LENPASS, 
as in the example relation abgve, then hat, the syntax hae lapeledaq: subject 
(in an active sentence) becones the second sraument,of, the ganantic relation. 
The decision about which word ie. the lexically active qne and which the. 


' passive one is arbitrary. 


thr BE ad 


acs “(relations ou, - a ee eee a 
This. is redundant Information, since we cen Yn alays Fecaver 


arguments a Uhahibed tokee » given | its. pene, Aby, seanirnsie: length of ite. 
associated restrictions ‘liet). ARGS i specified in the deft Li tion, pnusay. 
houever, partly for efficiency reasons, and partiy to help me keep track of 
things when writing the dictionary definitions. (0.3 ges 
CONNOTATIONS: 
This holds the connotation information. seninmes in. action 3,75 which is. 
expected to be in binary form (+elang,, seclenti fic. ntechalpaty. atc... Right 
nou, this. information ia optional, and. it. 10, na aed ay She eye tam, unt 


gener ation. 


mop te 


LABEL: 
If a word hae more than one semantic definition, each Js given.aiabel. These 
are used for error messages and crose-reterencing. 4i th, aiher..imformation.. 


PETERS Sia Tae EUR ig PT ONES TB FS te arti as 


The system hae 8 group of funotion® that pH Iatérmation out of ‘sewantic 
ad te weld “wil’e “tnat ‘tne right global 


ver abted-are: ute fis Gaied. © Wiat betiaad ‘of ‘et initton searchers 
“ siounts to Pe the aeftrty to use deFidi tote 


sivéty we tint as 
liperativetg: ‘Thre eticomett ddedal ly died ellie adetiticetion, “since sone 
Tenforiiee ton tv tie wari Citk, “TARE figpe dd ed, Ye Meaty meant fer use by 


°° “ee SMBS Powe fade: alter thaw te Wetuef ‘dliantic “Tifocietion. the” 
> det int tion weird 


rd arid Viet tee ebtety ‘on “thie Tye’ dnd’ ‘Order inforkation, 
howéver, aor thar Te fv fete 10 vega ‘thd —— ‘det int lone both 
descriptively and procedural ly. : ee 

Arie ther sort oF information. that. should be In the definitions, but is not 
nok, ie @ weaeure of plausibitity Tris could sdvondientiieh of o.number ‘thet 
epee fTee tie preweBettty tart: Li sili: ‘wands (CE Goad, giver that the word — 
octurs. Suit priest T1ty Wedsures eppedr “Te ieietid 6 sinton, but were “not 
tried “here tdbiaki oF tine Vintta¥ions, 9° 08 Meee eee 


# eis 3 


4.2.2 Other Definition Types . 

In addition: to the three standard definition types, there are several 
other dictionary funetione: SPECIAL, SYNOMINALIZE, ena SHREL-PAHT, A SPECIAL 
det Ifiition types thee: for HelatTona’ tit do net IY existing definition 


= types: W SPECIE Werint tion Jlet haw « shal Foutine aescclated with it.” 


This: Soh?’ OF Pideden Fi redt ly vetusite Tok Wicd tional’ ciises like said ("to 
be"), but. it hae been used sparingly eleeuhere in the system. Often, a 
definition will staet out as SPECIAL, but ther another word like it will come 
atong, and eventustty we have a clase, warranting its on function tipe., 
SHOHINALIZE iv ‘a-derini tion type’ auéigned’ Wb Herts rankbnift. Just as 


We have cituses serving se! rerkunit ted hous gold PTch élbs hicht, “ uarue er 


&S 


hier ist." / "I don’t knew uy he is here."), there ere leo relations: that 
may .be represented by. nouns..(:Oee, Sieben: #142: dtm echuer.: ":/-“He-has: trouble 
atanding."). In beth Gernen ond. Eng! lem, any vere: intlal tive may: be a 
rankshifted noun, and there are also words like "der Aufetand” ("the 
uprising") whose semantic defini tions. ane. basiestiy relations rather then 
objects. In the .sementi¢.representation.we.uant Ahegete.appeer as relations; 
but, since the syntax is a noun grou, -spasial antiene dave: ta be .teken: (for 
example, to handie time). SHNONINALIZE, thens:derresponatiete for -eomp.sf this 
action directly.and.also acts ge-a signal. -te.ether -newehendi:ing reut ines that | 
special treatment is in order. 

SHREL-PART handles nouns that nome a participant in a relation. . For 
_ @xampie, the word Zaighea : ("indication"). .can.be. defingd.ae ‘aomathing thet is 
acting as the first argument for the.retation AIMBIGATE. *. The SMRGL-PART -tor 
Zalbhan, then, builds up.e plege.of repracentatton.thet aay: teal? te. 


represented by Fig. 4.1. 
RSS 
#INDICATE 


The ASOMETHING and, ANGPEC here..are marked META fecauee they sre nete-. 
_Soncepts, or, concept wariaples. The MUNGPEG-eerker, (for: “unapeci fied" see 
section 4.7.2 below) would probably not be, used here, elnce. the inforssation 
“would be given somewhere in the noun group, as it is in our sample text ("Ein 
deutlich sichtbares Zeichen fir die ... Erregungan” / “A clearly visible 


| thet -osnteta the fetfesing latermetion: 


- badéoetion of the gunl betting") - ‘Bath ther GHREL:-PIRT ond the ‘SPINOMINAL IZE 
reat taee:ere at trevetee:-meceute: theyze€ fen -etonenyt we 6° net: “need &'“vebér ate 


concept: serker for a relation OF retetion pAFtiCHPERe Wédtéd as en object. 


ded Inderumtion inv te Sammie Bhertures 
biven . the: detinttion typse-amive: ate-deetiited ; hai pet wemant ntic componen apo ts 


Thiele the name: generated: for the venentte sede; ne tee riede ruses are alike. 


VARI ABLE= , 

A vertente: is svsigeed to on: Individual “tetent iaftenet eh object, rétation, | 
or property. © ‘Tine: eeleha vainee Greve-ungeniiie: Ft ve -couree of bul taing + 
representation ter Serene eaten a iccerinls wt't ‘the ver iante: ‘temeine 
the same. 


CONCEPT» | 
Thie comee from the cel ing paranatens, end TE te our old friend the senantic 
marker. _ | ~ « | 
“RESTRICTIONS= (PSS: ant RES oniy) ° 
Selection restrictions are hung: trem the conceptual. atractere: = 
associvted wi tts particular teletion, property or vistre sot. “Once the 


- ponaitic utitity program: heve retrteved a ont oF reetitottone they are kept 


on. thw: semantic nude: for later reference: 


Sheard ot PER hoe Akg ore bee etre akerg gre oo 
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LINKAGE= (RSS only) 

This is a list of the participants in a relation, and it represents. half of 
the explicit. linkage that holds the semantic representation together. 
MOOIFIERS= _ — 

Another mechanism. for Linking, this thematic foaure binds. oss, RSS. and PSS 
components to their modifying relations. While LINKAGE. values. produce a: tree 
structure, when values from MODIFIERS are added in we can get a general © 
netuork structure. A semantic node with modifying relations is cal led. the 


“head” of these relations. 


CASE= 
CASE is set to a gencept marker that Is. found above. the CONCEPT.on the 


selection restriction tree and diatingui shad. by. a speci) tag. This is 
redundant information, since the case is always derivable from the concept 


marker. It Is useful information to have around, however, especiaiiy for the — 
deductive component. CASE is ast. only for PS! ».end RSS’» that act as 
modifiers, . that is, those which are conmected to the, senentic representation 
by MODIFIERS linkage. Sample cases sre. SPATIAL, TEMPORAL, and. MANNER... The 
use of the term “case” for this semantic feature nig be misleading, and it is 


discussed in more detalii in section 4.18. 


TYPE= = (RSS _ only) 
This is the TYPE information supplied by the dictionary routines. It ig not 
_ really. semantic level information, but Jt. is put: inte the representation 


anyway for efficiency reasons. 


. ORDER= (RES ornty) 


; See TYPE. 
REFERENCE -SCOPE= 


Thie ie set to either GENERIC. or PARTICULAR, depending. on whether the . 
Intormetton given ie: avout e pertitvier eject, retetion or ‘property, or about — 
the clase thereof. 


GIVEN-NEWe 
This is set to GIVEN or NEW. See section 4.9.2 for a explanation. 


COREF = 
This Gives @ tist of senentic structures that we coreferent ulth this one. 
It 18 discussed further irr section 4.8, | | 


INFO-ORDERS 

This is set to eittior UNMARKED or to s list of the mod! fying relations in e 
clause in the order thet they eppeared th ‘the eur fies ddiiainked ‘At is 
discussed further In section 49.3, = = > 


CONNOTATIONS= | : ee pos 


This ies the connotation information from the dictionary definition. 


THEME= 
Thies Ie set to the sementic node thet corresponds to the theme of the clause. 


See section 4.9.3 for an explanation. 


RESTRICT-DESCRIBE= ee 
This is set to either RESTRICT or OFSCRIBE.. See,section. 4.9.3 for fur ther 


discussion. 


CLAUSE-TYPE= ee 
This is set to COMMAND, QUESTION, STATEMENT, or SUBORDINATE. See section 


4.9.3 for an explanation, 


PARSENOOE = ae 
This is set to the paree-node that supplied the concept, if there is one. 


PARALLELSs . 2 
This is used in representing a variety of coreference. See section 4.8 for 


details. 


4.4 Non-Lexical Entries in the Semantic Representation 
The word definitions discussed above form sn important part of the 
_ semantic representation. Not all entries in the rentesentation, however, are 
formed by words - some entries represent relations léplicit in the syntax of 
the sentence. Some examples of thie will be discussed here - e.g. the 
postnominal genitive, adjectives that modify. nouns, @nd.compound nouns. These 
are closely-related cases. and by no meaneexheustive, but they give a. 
representative indication of the issues invaiveds Wherever there is an 
implicit relation, it is the SUSPEC. routines. that. supply it. and make the cal! 
to SMUTIL. 


4.4.1 The Genitive 
Starting off with én example, the semantic répresentation for the — 
_ genitive construction "der Regenechira der Dame" ("the tady’s umbretia,” . 
literally, “the umbrella of the lady") might look tike Figure 4.2. 


Figure 4.2 
For “des Auge des Mummere" ("the aye of the lobster") ie might have Figure 
4.3. Savion of fee Ore ee 


Figure 4.3 


In the diagrams, "head" is used to indicate that the inking of the two OSS’s 
is done using the register MODIFIERS, rather than’the LINKAGE register. In 
the tuo examples, the 0SS’s are formed in respones to the noune ‘in the | 

‘ phrases, but the ASS reflects an implicit relation. (Actions taken for the 
determiners have been left out of this initial pass for the eske of 
simplicity.) The tuo phrases give no explicit clues to guide the choice 
between AOWN end #HAVE-AS-PART as interpretations. Mhat’s more, there are a 
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number of other possible relations that may be iaplied by the genitive, @.g-2 


der Geruch des Kasee = the suel! of the cheese 


aspect of a thing + thing 


das Buch des ber Ghaten Posten « = the ‘tamoue poet's | book 


PAGS LAE smyeneg SM hope o 
crestion + creator — 


der Stadt meiner Geburt = the city of my birth 
: ad — SNES Sy aS hes Hae : SN Be fag ie saree hE A: 
aspect of a relation + relational noun 


das Geschichte meines Lebens 7 ‘the story of » fe | 
— 5 eub ject matter a S 7 Be i. 
There are many such relations that can be be eupressed us using the genitive, but 
the possibilities here are not » causlerele Coen: For ccasia. ae: 

das Pflanze meines Schreibtischs / the plant of my desk 
cannot be construed to ner the ‘plant. that is on my desk. To say this, both 
German and English use a preposition: teut-i on) te explicitly express the 
spatial relationship. ~ Thus, tf the number--of retetions that are implicit in 
the genitive is bounded; as 1 believe to be-the-caee;- Bis aa sense to talk 
about producing aumantic representations tor the! i Kferent: possibilities. Some 
of the possibilities are in fact constructed by the aysten, using selection 
restrictions as a filter to block the blatantly impose ible comb inat ions. The 
next step ie to take a closer look a ‘these ponents ‘representations using the 
deductive component; but let us ret finish wD the discussion of how bis build 


semantic representations before going inte, the question of hows to choose 


betueen then. 


4.4.2 Noun Modification 
Another exampit of @ eumentic retation trait is inpt ted by a grammatical 
Btrocture ts MHAVE-PROPERTY. how praperty weed — oA object (or a 


Seerel Sh dan ® 


relation or another property, for thet wetter) le given a: iare-poorey 
relation. Thus, “der kurzeichtige Wivesnschattion® tthe néareighted 
scientist") would be riprevented: By Mikal 4.4. The WANE PROPERTY relation | 


igsx5 3 
57 RES 


is special, in thet the eslection restrtction tor ite firet er rgument ‘Te found 


associated with its second. In the waapie, the ‘pestrittion #LIVING-THING is 
associated with WREMASIGHTED, so our ecientiet would puss the test. Instead 
of conventional satectiona! ceatrictione, therefore, WAAVE PROPERTY hee a 
procedure, which ts autonatical ty enstuted by the. ‘restriction ‘checking 
procedure. The restriction code for move -prerenty rete teves the selection 
restriction from ie ‘tecond ‘Srpamant ant ve i it to portore ‘the check ‘on ite 


LER we ts 8: iff; Std eo 
> eb Ss ae 


firet. 


“Figure 4b ae 


One Vesue in oun wodi tier representation shoud be mentioned here. In 
Gernen, as in Engl ian, adjectives. ny be ‘stacked ra in ‘tront ‘of nouns: in two 
ayer serially or in peratiel. The parallel version ie “the ‘one thet ‘often hes 


a comma "die kleine, ‘lebhatte Eledone* (the euail, active eledone"). The 


representation for thie is straightforward, ss:ehean-in Figure 4.51: Serial 
adjective lists, on the other hand, pose probdase-o4:rapresentation. The 


cs 


modifying relations for something ‘Fike “*eer nette alte Mann” ("the nice old 
man") coutd be represented ‘by: = 

der (nette faite Mann) ) 
That is, each adjective modi ties the entire remainder of the phrase. The 
correct. Papresentation, therefore, weutd seen to be thet shown in: Figure 4.6. 
- This sort of representation is. accuratey: “Rut I sthink athe proliferation of 
OSS’ es would cause-e great des! of saxtra-et fort jfor the generator. Another 
sort of linkage aguid be necweeiryt0:ratete: the different 0SS’s of a noun 
group, and -representsttons would be lerger. - Yo avoid jthie, the system 
abbreviates the representation of Figure #.6-elightiys using ‘the modifier 
relation estab|.lesed -betueen adjective and.noun instead of a neu oss. The — 
reeult is shown tn Figure 4.7. 


Figure 4.7 Sess . r 
This representatton might break down if nodi fying relations themselves have s 
lot of modification, but this woutd only be happening for prenorinal clauses, 
not AHAVE-PROPERTY retations, and clavees will :prepepiy not be étacked-up 
serial ty more: than tio-deep. : | 


35 


I should note that only the siuplest sort of adjectives ere currently 
handled by the semantic component. . Foran anelysis.of,aeme of. the 
complexities involved in English adjectives, see. Vendler..(36). 


- 4.4.3 Compound Nouns se vad 
The German compound noun is often tranelated by a classifier piua noun In 
English (for example, "die Faueruehr" / “the fire.denertaent")... Among the 
relations that occur between the perts of German compounds are relations 
discussed for the genitive and sinple-newn-sod flere. 90.the representation 
described in the last tue sections.is alee applicable here. . With compounds, 
we also gat relations that could be expressed using prapos|tlons,. euch Ass 
die Gumistiefel «= Stiefel cvefuml os, 
rubber boots (material of) 
die Trinkgldeer = Gilser zus, Irinken 
drinking glasses. {use.or function.of) 
die Todesanzeige = sine Anzeige .wegen des. Jodes 
death notice, obituary (occasion of) 
dia Seereise = eine Reise aul.der See. 
sea voyage (place of), 0 0 wes 
Especially with the more.common implicit relations, we would like to be 
able to handle compounds that are nat. in. the dictiqnary,put phose parts are. 
The system does this in the routine. § . ...1n.9 procedure. analogous to 
that for genitives, the semantics. of. the. comppnent. yords. are.boynd to @-group 


SS Sd Pe 


of possible relations, selection reatrictions perad thing, of course, The 

representation produced looks either like tha output of the genitive routine 
or like representations of other. noun.modifiera.. This approsch is desirable 
because the representation resembles thoes, built. for similar. structures, the 


a sce 


dictionary is not feaded nith-redundent-deFini tions, and the system: is able to 
cope with nen compounis it has not ssben before. . 

The drawback ‘ot using ‘the senentic compinent toseppty twplicht relations 
from a built-in set is that the range of relations ‘possible petueen the 
Compound’'s components te-much wher than die range af telattone tor el ther 
geriitives or -sispte noun wodftiers. 1 edepect thet + “te. ati open set, and 
this would meen ‘thet Certain word paire could have Hiphtit rétations that are 
completely idiosgnoretic. Ife petatton-eppeers ‘only “hone compound, it 
obviously toes ‘not beéteng in SHCONPOUND, Tits ‘weenie “to “be the plece for a 
dictionery definttion, end tn féct the SPECHL definttton fect lity in the 
‘system coutd ‘handte tte situation arti-ne ‘troubite. ARES for the tepiicit 
relation could bedullt in the defini thor, ‘Sifding ‘tid ccolipound’s components 
as participants. hie ie clearly an eftictent approach, “es long as the systen 
has a definition “fer each idiosyncratic compound It erdountere. 

‘Between the compounds formed fron a predtétepte: wet and the compietely 
idiosyncratic compounds ere a group that shéw seis ‘regufértty, at though the 
particular retations involved are unpredicteaste. ‘These aré%QIven 
representations by the semantic Component, out en MEPEC ‘ttor | *unepeci fied"? 
marker is used. Fhe werker is discussed ‘petow in sictiton: 457.2, and the 
Compound chess is investigated in-moré detail In’section §.5. | 

“Finatty, there :are compounds for wileh the memning of thewhole Is 
different from the sum of the séenings of ite parts. Consider “der 
Tintentisch" ("the cuttlefish” or “squlty" fiteralty, “ink fish"). Leaving 
aside the polnt that the squid te not a fish toa wiologist; we note that “der 
— Tintenfisch" refers not just to any water eninal ‘that: epee an inky cloud, but 
to the cuttlefish. If there is some other fish-like creature that also spens 
lnk, it would not be designated by Tintenfiach. Suche situation seems to 


“shalt coat Re eager gaia 7 oo RR aS go TTA FT ES oo IE TRE co SR aa 
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Marrant @ separate. concept |ike #GQUI0,.and ao. the rapesaentation, for..thie 
type of compound looks no different from.that af a.reguler:noun. 

For representing compounds, then, the system offers four,.alternatives: 
pre-packaged. Implicit relations, special. dictionary dafinitions,... 
representations that leave the relatiqn gnenpeitied for the time, being, ..and 
standard abject definitions far. compounde, Has paanings. are. were than the sum 
of the meanings ot their parts, wel, faran: Bolt ae 

Thie last group of. pompaunds went joned.ralese.¢ qupat lon. Qo not all 
compounds, in fact, tend to be sore than. the aup.of. their. pac te? When. faced 
with “Gumniatiefel® ("cubber. boete"), we bnew sonpthing. about; this apasia! 
type of footwear, just ae “a have speci fle intornation: aout. "Schniretiefe!” 
(literally "lacing. baote” - any. boot. that. has a shoelace) anc ‘Holaaehuhe" 
("clogs"). We would nant to.essociate this intormation.th. a:speoific . 
concept marker, rather than the sore. generel conpept #AOOTEAR.:: Given the 
system.ue have nou, if the. representation for, “Guanietiefel!: see supplied by 
SMCOMPOUNO, there..ia nowhere to put. the informations: Toe je # problem, but 
it is one that the system does not have to face, sina Jit. de: net intended to 
do any learning (i.e. it is asewmed thet the information in a sentence would: 
never be used to permanentiy change. the daductive date bees), 

If we wanted to allow jearning in. the syetemy we. aight: try the feliouing 
approach. When. anew compound ie; encountered, the implicit: ratetion: covld be 
selected by SMCOMPOUND, but then inetead-of:miding. thie:te.-the: esmantic.. 
representation, we could create a new concept marker. For example, @ new 
"Gummistiefel” concept would have FOOTWEAR as genus and something | ike 
(AMATERIAL-OF X #RUBBER). ae differentia, Thep.any newzinformation learned 
about rubber boots, 8.9. that they are Horn in, the rain,, could: be associated 
With the new concept. In addition, we cquid. add a diotienany.defini tion for 
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(immistietel, se thet the next uee of the tard would invoke the new concept. 
It would omly be economical ‘to butld new dencepté, vf ‘courses; if they are 
realty vsetul for ergenizing intermetion tn the teduétive date’ base. There 
would therefore have to be some criterton for How! muck’ special information is 
needed to justify Ge creation of # esperate cahcept wirker. The crestion of 
new concept workers noutd ates attect genehetion, ed’ ke would need @ facility 
to update dictionary definitions for the generater ‘¥ecordingly. 
Since, however, we sre not trying to adeerloe @ aa 
there is no need to generate new Géhcept harkers.” ‘The cospromise used in the 


k for learning, 


syetem was to give concept markers to compo 


"Information aeseciated with thew per'an 2 Tike “tee Mérveriyeten” / “the: 
nervous systes”. Merde for snich the bulk of epeeTat iitormation ould 
probably de encountered in the input Rent for the iret tae - ke “des 
“Chrematephorenepiet® / “play of the cteuhetiipliorte” <' wre represented by 
chunks of senantic representetion: (retetttn’s purticipentels © Thuse ére ” 
“ constructed by SMCONPOUMD, «2 eer itech trontaaliall 
relation +s Hwolved. 


It 1s thus secuesery toeupply lap tott retetiony tor genttives, © 
adjectives, and:cumppunds iyithe: heun groupin ‘ortis to develop 
representations for the di ferent possibtl ithes In generat, onty the 
deduct [ve component can decide between the wet of diferent representations 
produced. | : : 


"6.8 Nordé ti thoat Séuantic Represtiveatione: . 
In the teet section ue rooked st ‘Hula tfote that were ‘not ttea to © 
specific words, but row the question is idette 


‘an entry In the semantic 
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representation must be formed for all words, - ie fact, nat aj|.words add to 
the representation; - general ly,.. words. that a9. not are doe. ip.penantic..content 
and very high in syntactic fynction. The past commen cose, of this is 
prepositions: that mark participants. In.cedations. Fog; expaple. ; 

Die Eledone. reegier te, aut den Reiz, eee es, 

The elegone resected to the stimuue, : Sie ee 
Such prepositions occur both with verbs and sdisctives, put ooly ree former 
will be discussed here. Obviously not all prepos) tions, ere of. this type; 
those that mark location, tine, counel | ty, . atc, are, blab. Agdivigual 


. semantic content and not dependent, except in the most. general way, on the 


particular verb used. The prepositions}. am. consider ps: are, those. closely 
tied to individual verbe and uhose functions. are, pec formed bu..cese: in other 
situations: “mir gefbllt as" versus... "j¢h freve mich. darbpac” Ip. German, 
(roughly. "I. Like ig" ond "I om happy about 1°), legk.at.veceya obascye. in 
English. Where. puch prepositions occur. their.sepant ica, ig-essentighly.« no- 
op, 1.@. the semantics of the prepositional.object.is.styuckinto register 
marked by the preposition name. hen, the relation associated with the verb Is 
evaluated in SMCLAUSE, a list of ite required. prepos) tienes retrieved from | 
the collocations list. These Preposi tiong ere. indexed by aprentic definition 
labels, since a difference in preposition, cao. Indigate.@.di tfengace in 
semantics ("dient als" / “serves as” vereye "dient ut, / "esaryes to"). For 
each definition, then, ue know rset wheres. to JO f9R the seventice of its 
participants. In the case of a required preposition, we just pick up the 
object’s semantics from the register that nas. set. Note that essentially the 
| rable prefix 
verb is considered to be one word. This uni formity, Le desirable in light of 


same procedure is followed. for -separad | e. prefixes, singe agape 


the close relation between prepositions and separable prefixes. 
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“It ts not, perhaps; entirety fair to tial that prepositions that mark 
objects ‘of the verb. are devoid of semaitic contents In tatt, Tuas eurprised 
at the regularities that 1 ehéountered In’ the’ courte’ of organizing the — 
selection restiiction tree: Filinore’s Gabe’ theory (6), of course, is partly 
based on this sort of reguieritys Asan exéuild; thé ob ject of a mental 
process (including in this perception)” is often’ ee hed ri | 

ta) -Ich danke’ oft daran. 
I think of it often. 
“‘(b) Ich etrinere mich daran. 
I-rementer it, | 
{c} Oas' ist’ an Sepia zu beobachten. 
That cen be" observed in Sepia.” 
In a Sense, theseregularities are not” surprising, rity piopté have to 


“rekember which’ prepositions go with which vérbs, aha the sore: regularities, 


the better: On the other hand, the’ situation ie°colipi1éitéd by the’ ‘tact that 


the sdme preposition may be used with & Father wide variéty of verte ina 


“ riumber of différerit ways, Thus we have for auf: |” 


Md) Er outer aut das Put Tkum. 
‘Head ah effect oh the "publié.” 
(6) Er religtert nicht dariut, 
‘Hé did not‘react to it. 
‘(fF Wit Habe stunden! arg. aut’ ThA wate . 
‘We watted for him for ours. — ; 
The semantics of thebe required prepositiona, then, can. give a clue to the 
relatlonbetiéen sdbject and verb object, but the Waited | ‘is tever 
definitive ui thout the evidence gh shad ‘by the verb. 


ss HP mtee: eRe Dhaghe KREME 
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. 4. 8 Idiomatic and Spacial Uspges . : 
Another situation in which the Horde. ina clause nigot_not wep one-to-one 
onto the semantic representat ion is when. Idloue are present. ae using. 
"idiom" here in the senent ic eenee. to meen. ony | {phrage . Sk: Atel penning le 
different from the sum of the meanings of ite Component _ words. , A seuentic 


lated | with: Ape phraee 
~ itself. In the ‘eysien, the most general bean Ae. handle an idiom is with 2. 


‘representation for an idiom, ‘then, is proper ty o 


SPECIAL definition. For scathing likes 
Das Eisen schmieden sal ange es heise ist. 
Strike while the iron is met: 


we could nite a SPECIAL definition for ‘sctmiadan Crateine), that “would check 


for the presence of the rent of the phrase, then pray ide. apes semantic... 
representation to embody a peeing Vike “Act while there, As. an. , Sppor tunity.” 


"Eo git 


Note that the suaten doe rot deal with ouch stenin idioms, right NOH, 


“end 


because none are present in the » sample ) beregraph. : i ocd ¢ 


5% ae a es ee eed 


24g 
The syaten: does hendte nore restricted idiomatic usages In tuo HOYOS . 


through the collocations list and through the selector wechani sn. The 
selector was mentioned above in section 4.2.1. Its purpona...a. to cross- : 
reference sehantie: Cer inisions. a wntectic. features. | , A-fagllity not 
“mentioned above is the appearance of J) words . #8, welt 9. a feature, . i) 
selector. In such a case, the senentic Gefinition ae. applicable. only if the 
next word ii the sentence matches thie word. aes od : 
The word selector ‘facility is obylously, limited, 4. It would not be. 
included in ‘the ayeten if It did not come essentially "for free.” The. . 
ner.ality, although night ngH it 


is used only for associating Prepositions with verbs, a8 described in the last 


col iéeattons list is potential ly of ‘oreater. 


section. In the col locations list, we can Index the weaning of @ verb by a 


Sah pA LS pa HR ee 
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prapesition or sepucable pref lic et eeuhere in the pales: lt would not be 
“difficult to extend thie mechenien to hendte idioms like our "Strike while the 
iron te Kot" exile, and 1° ‘suspect ‘tat thie would os a ‘good way to proceed. 
Wr itttig a “SPECTAL eat inition for wvery ‘dion ¢ that we want te add to the system 
‘wéuld Be tine ‘consuming, ‘and the code most we ‘repetitious. ‘The pest policy 


‘ 


Sééne to be to vee SOCIAL det ini tiene coer iat. “end to. use ‘the col location 


abide: Hae 


siechenien to reflect the regularition that ‘can be found. 


4.7 Spectal Entries in the Seaantc [Repraventation 


Several aneeis! sence t warkere igi vee in ‘the vomantic representatjon 
fers SRLS os 


ae it Te pulit up. these are the concept varie 
; a es ; Sbps fi (apts d o Bait rat Lot 
PAUNQOUNG, ariel #AFERENT ‘the first, | ING, ‘halpe represent | nouns: that 


aof-ps agem § ybedes ot 


fate a participant’ fn a relation, aed ite use uae iI uetrated in section ; 


4. 2.2. “fhe three other ‘we taconcepte, whieh mT ail have been replaced by the 


tine that the generator pete the » eeenie representation, are the subject of 


thie section. 


4.7.1 The MONBOUND Flag 
“The MUNGOUND worker ise temporary Placeholder which disappears by the 


Sage ar 


tine the éetantic component ‘fintehee Ite wrk. The purpose of this nerker is 
to at fou ‘evetuation of relations before ‘all their "pertieipente neve been . 


PS, OTSA : 
bound. Thie ie not in any wey a theoretical necessity, ‘since we can always 
walt until ar the: ferticipants: are in ‘vetore evaluating ‘a relation, 1 find 

Sigg BR 


ita tdeierdteery wotdtion, | however, ‘becavee it | heope the eoaontic € component 


Si fe re 


ere siete 


possible £5 hebben as seen a0 a phrase he been persed. 


‘AG on exsuple, ‘consider the phrase “dle in ‘Nerveneysten verledtenden i 
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Erregungen” ("the exci tatiens. that .rus-thneugh the nerwoue.syetem"s 

literally, "the. threugh the nenvenie myetee reaming engitatione").. Sinoe.vwe 
sant to handle the pronominal .qteuse."lnAleereneysten wer leufenden’..es.s00n as 

it ie pareed, SHCLAURE produges -the repregentatian in fiigure:4.8 (and maybe 

others, of cqures, for the.al ternative. Inteepnetationels -dlote thet the 

prepositional! phrase "la Nervegeyaten”. as ate hand bed 4elng AUNROUND,. .but 

the processing for the prenominal. claves-heaeineady deue.the binding .there. 

- The representation shown is left at the clauee nodeswmene ‘it ete .untit the 

main noun Eragon is boreed end SING] Ie ectivated. SHNGI calls the 


SMMOOIFIERS routine, which is, as ‘ite none Imp iae, - generat clearing ‘house 
for modifters of nouns, verbs, ‘adjectives, “and { the coat, ‘SMODIFIERS takes 


ey Uptedekeaey oes 
note of the fact that the modifier is ‘a ‘pranoainal louse, 0 it knows thet an 


HUNBOUND needs to be replaced by the 085 for ‘the noun. "Seeing thie, it calls 
REBINO. to make a selection restrictions ‘check, ‘do the “binding, and supervise 


any renaming ‘that Is ‘necessary, ‘to hoop the eonentic ‘representation ‘consistent. 


Lirica 


Besides ‘prenoninal Clauses, ‘the ry ‘ wechenion is ‘used tor 


preposition groups ‘(since Prepositions are ‘vensraity Fepresented by tuo-place 


relations), ‘for ‘bubordinate clauses of ver lous ‘sorts, “and for. ‘those adjective 


| groups that are ‘Pepresented as relatione: _Mthough JX saleaead all 


at. 
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coneti tuente for gertic ipante.wf the retetion will neve been pareed, suet ly 
not aH will have deen evaluetedseanentiaalty (fer ‘emipte, the aalit verb). 
Therefore, the ise iof ‘the 4UNBINNE sieonaition ‘Fé justified in moet’ case,” and 
to keep ‘things witforn, TI have wend- It threugnadty: Shite’ rer tot feature is 
probably alee weetul in’ interpreting Eegt tai | tedauninp rt-eeveral other high 
fevel detieions are ‘sies kaptl. ‘Foe specter ratute of Gerwen ‘eyntin 
{pretmninal cleuste, andeorder sfottae)., “henende; aahae ‘ec oor't ‘of ‘partial 
binding -seoheniswaesential. 


4.7.2 The #UNGPEC Marker ix 
MUINSPEC, for "unepeci fied," te on sacips fetch for the semantic 
component; it is # placeholder for some of the Information that is left 


ete ae carton 


understood in the: sutterance. Sone AMBPEC workers may be replaced when noun 


et at 


group reference is deterained, but others are lett. 
to mull over. hn exaupte: should. sive A heteer-teererweat “MINSPEC ie used 


7 tne deductive umeasnt 


or. .One place for this sort°cf darker is in relational nouns. Often tine is 
vate for the reeue te ft in, # in 
(a) Kerl arrinerte sich 3 an dae Rennen. . 
. Kort remeubered the POR - _ aa 
| vigee Ker freute sich .auf al fermen, 7 | 
. Ker! Hooked forward to the race. oe. yd 
in (a, the race predates Karl's mental sotion,, tn ) Karl's, mental process 
comes tiret. Further, for neny Mane ef. relations! “Reyne, fone, participent is 
left out, frequently the agent. Ina tot of _cenee, the gent is understood to 
be the universal anyone, for exsaple, _*Schi Laufen. ken gettin ich, goin" 
(Skiing can be dangerous" . ive, anyone Lied akie can find. it denger ous) Not 
al! such constructs imply " ‘anyjone,” however. Some agents can be unique, as in 
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ting"). uni ke post 
understood agente, can only be determined yithresect .te,the-agqtexts “das 
Abechal ten des Stroms” ("the cutting @ff af, te mower") ,cquld,be,dong, by a 
homeouner, a company, or s.repeirnan, The MANSPEC apr ar..P9n,be,ueed, then, 
to defer the decision untl! wore infaragtion je aysi lave. . Another use. for 
WUNSPEC is when the grammatical peselve.tes.ne eggnt., ("Der Strge,ulrd 
abgeschaltet" / "The. power... tg. -baing,.gut aff). We wt, : fhe game range of . 


Pope PE pgyttes saggy 


“der Erfinder der Buchdruckerkunst" ("the inventor .of prt 


possibilities here as in the relational noun,with.yn 


4.7.3 MUNSPEC ¢ for Ellipeie -_ ae ah eueet 
, The JAINSPEC uses above are nang ty, a orgnd by, anton, In sone other . 


~~ Say 


: situations, Hhere. information. is left Ute there. sto. be. a. lexical bagis 


for the deletion. H0.Our peregrgeh,, for. meanolen tere | le. discussion of 


TSR, 
uhe ther octopuses can perceive Colors . Miter, Siviog.euis Ce. thet supports the 


existence at color Perception, the. author. mr," nege 09 pabwint Gee ze 
mindesten for diese Forpen. for aloap Earkagelon. zy. serecen. (ore thioy then, 
at least. for these specieg, sepme. to. support. a.selor senaa’). Hece. the author 
has substituted a noun group for a relation puch ss." the gxisteoce of” plus 
., the noun group, It seen tp.ne tat, pboreyiations Like this depend. vecy,much 
on, the special genes of particuler words. {vith popsibly, same. grouping into 
Classes of words that allow similer types of ellipeig);. lo thie .exguple,, the 
ellipsis might be triagered by. “sprachen. fOr". (‘euppprtt). For. there, canes, 
then, it will be up te @ SPECIAL definition reutine, to jntreduee. the #UNSPEC 
marker and do the necessary binding. Osdurtion,.can.then decide uhat.cetation 
ls understood. This definition approach. guarantees that. the. eyater, can. at 
special Cases and know what it neade to bind, for. Wpeth Par Sala ‘Fane, 
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4.7.6 The #REFERENT Marker for Proneuns: : 
Tne MEFERENT warker is used for third person pronouns, @ince these have 
‘no concept markers to call their oun. This réflects the fact that for the 


BS 


‘gemantios of & pronoun, We are total ly depér on thformetion from the 


coreferent noun gréap tor on our khouledgé of the actual ‘réferent, as for the 
first and second person). The #REFERENT ‘meiKer “ie siipilied by pronoun 
definitions, and it le similer to MUNBOLMO if that Tt has been replaced by the 
time the Setientic representations reach the deductive component. The” 
mechanism for handling AREFERENT is also functionally sinilar to that for 
#UNBOUND, in that the seme sort of rebinding is déne. | 

Let us look at the use of the MNGFERERT marker in more detail. First, 
for things like personal and relative pronoune; @REFERENT might be repleced 
almost inmediately. SING! causes evaluation of the pronoun definition, | 
setting up the MMEFERENT marker. ” SHNG2 probably ulll not be ‘called, as ne 
generatty will not have qual! tiers foiloulng these prénouns: SMPRON is then 
called to handle reference. Its job is ta construct a list of possible 
referents (using heurtetice taken with I1ttle change frée Winograd’ s systen) 
“and to eliminate thoes that do hot agree with the pronoun syntactical ly (on 
the basis of gender and number). The MEFERENT OES Ie ‘then ravound to each of 
these possibie Feferents. As semantic processing continues, somé of these 
WII probanty be ellminated by selection réatriétion ehecks, and the final 
‘coreference decisions WIT) be made by’ the deductive component. | 
From what hee been said so far, the reader might conclude that the 
WREFERENT marker Wnt necessary in every case. It te true that if the 
pronoun0SS is rebound immediately, then the barkai tan extra step. Even 
pereonai pronouns, however, can aake forwerd referertes, which are enough to 
justify the marker. Consider the exemple: | 


(187 


Ehe or immatrikul jeren kann, aves. Johann die Antnaheapr Ofung waghen. 
Before he can enroll, abana han ta take. the.entnance exam, 
Here, the #REFERENT marker is ingerted for gr..epd.it.te.not rebound. to .the 
OSS for Johann ynti! SHCLAUSE is galled, for the, PRIOF CLIMORS Another, .. 
situation unar 9. the HPEFERENT. merker, te Amati fied.ie the da-comound,. were 
forward reference ie frequent, In the saeple pgraazegh we haves. 
ssstae ebenfal.lesebr datie earichts dene oe Farben zu 
unterecheiden vermogen ge eg Feeentene Bie ous 
_septhich again very, strongly suggests. ‘Ast, gpene, eninals are ml to 
distinguish colors e eesti eet ene ra ghee. ae he 
Here, dafoc refers forward to the, . "dase" claus: For, oy sorte at pronouns, 
the SREFERENT marker frequently. reneing as.ong posgiglia jaterpretat lon, yatit 
‘the ‘end of the sentence,.. Waiting. for the referent .te be .found, .... 


fiat 


gy, pail went.gome wey. to. represent 
coreference. Since coreference, in full noun.groupg yl! Geremeeented Like 


Pronoun coreference, both sre discussed together in the next cection. 
4.8 Representing Cocarerbeed 

It seems desirable for pronouns end definite qawn.groune. (mere properly, 
_for all noun groups that are coreferent ui th other .naun groupe. in the.text) to 
have similar semantic representations, Qy the time the,ganerator ease. the 
representation, there will be no explicit indication af.whether . the surface 
structure. contained @ pronoun or a, full peun.grqup- ..Thig makeg, conag,,.since 
the target language has ite oun rules for coreferent .nayn groups and progoun 
insertion, and these may or may not coincide with the ruleg. in, the source 


language. In this section, I will. dlecyss finding. the cereferent noun group, 
the representation built, and iseugs of peaudo-cocefacenca, Al thaygh I wil! 
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Speak ‘Of @ HoUh Yretp “refet+tng® to shitter ‘nidun Grous. IT rwally mean that 
the tuo We CoheTEMENE! Oritg extrac Tigébet te thifige for quoted Horde ee 
phrases the [tagdtetic dtecutsltn) “dee bttudl IyFeFérred to by noun groups. 
Note that reteFehidds Caribe wade to “otfier’ ‘éxateuents By Using pronouns Tike 

"et amity a ductor ‘béten’l PTT Botentretl 6 ndui’ group 
coréferendé, “but the wPtuation Te‘nden the said atin & rétation is involved. 

Por “a prorioutis “ae sate that thé sone 


it Golieridht “acédeulstes a list of 
possible coreferent structures. For full ned “groupe ‘tha "sftuatton is 7 
alTottty diferent, and the’ Spar oNeN “out ina tere: 16H tous Winograd. Pronouns 
are so weakly specified semantically that they cernot be separates from their 
Heférdritd ‘by a Gredt Bietandas Full Houk' grédps, honever, re en better 
ppectt Téa: and W'rbteedAt ‘couta’ potential lg Be ¥ednd eiyshere Th the text. 
While I suspect thdt Mefereriées outeid’ e-pécagraph are Timited to certain key 
dou groups, I afed ‘think that déterntniig ‘iRbeS noun | oupe 6 non-trivial, 
 [.@. ndt obVicds frow surfate Structure In every ceed. Se to proceed for full 


noun groups as il dia for fironouns - “eomettudtiing a ‘possibilities list and 
narrowing it - will not be feasible. Even if we were to limit our search for 
referents to the scope 6? w parabredt, bu posbtbilities list would not be 
vary interesting, Yethee ae havé’nd Boddy fo Aerfod ddan’ the ‘possibilities. 
Ih ste si taattone i might'be: abe to'ude seledtion réstriétions to narrow 
pesdlBTlitiés, Bit TA Qdnerat they ultT Ast Ba edequate. “Furthermore, | 
Coreferént Ful! hodh Prous do not aged GTA thel® Feferenté [A gender ‘and 

 namde. Me BUT a Have a potelttal ly butky possi iTi ties ‘Ttat “and nothing to | 
do With 1 te Par Ris Fadeen, Tt Te Tett to the deducts “Component to 
determine which “fall ‘Houn groups ‘are coretetent. “Althdugh the deductive, 
rather ‘than thd Vemdritit component; ull bE dddlfig eéreference information to 
‘the eeNHtlc repredetitattan Yor Yull noun grote, I vould Ike to Finish up 
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the discussion of coreference at: thie time. - 
The main feature of the representation is thet: coreferent: i tenes are -given 
the sane. variable. Weactater setting: VARSABLE«);: 2a ShbUgh” thwke°GS8i-or “RES” 
ames differ. Pronoun sementic nodes si! |sadaage (tend: thy tr CONCEPT: getting 
from the coreferent nown group shen. REFERENT: ip reboandy: Aséutt neun group, 
an the other. hand, always suppiies:ite amm-concept: mankes::Given a:variable 
name alens, it is not: always easy: to: find other: cementic“nodessshering the 
same variable, so the register: GQREF:je:eet te'e sist-ef att the: coreterents 
of this node. That 16; when we find:acbeck retsrence, ue set the COREF 
register beth on the referring: sanant ic: nedesand.-omctbe nede refered<tos' This 
‘Will beuseful for generation, since:@ back reference tn: ene language night be 
better translated.as a forward raterenca: In ane ther + | Figure 4:9.ebous some of 
the. intermation that will pe presgent-in:the-Q56"s ofntne stuo -coneterent angun 
groupe: "the cephalopad".and."thie animales. = ss-cuns tus 


Note that other registers .are..set..separetaly-for copeferent: eenantic nedes, so 
that they may, and-often-will, differ:-te NOOIGLERG, <GIVEN-NEM,:.0tc. 

There is. another. phenomenon thet beheves:much. ithe cereference; but: which 
we might call aseudo-cereference. Consider :tha-evemple::« 

Anna bendtzte das grosse: Wir tersuch*und ich seabtztecdas kiaine:. 

Anna used the big dictionary and I used th@ a@giiiiems. :- 2 
The noun group underlined in the German is elliptic for "das kleine 
WSrterbuch" ("the smal! dictionary"), and we can supply the main noun by 


looking at a noun group earlier in the sentence, “das grosse Wérterbuch" ("“the. 


ieee eds SESE es Se SPOS ETE ee a 
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big dictionary"). We might think: of thé shanndenaehsgsoupae-making @ back 
reference: to. “das: grosses. Wirterhuch," exespé- thet 14 refers to: the-general 
class. "WOrtenbyck" ("dic tioneny th ctatier:: thamc toc thecaorecrestriected class 
"lange dictionary.” lw. thée-cease, us arecnot reettg. duating ul th’ cereference, 
but Héth. am. sbbreviededcuny: of dbetinguiehing: tan experete bat related 

» Qhjects: Ade. bopronent: the: tue. ubth. ¢1 | ferent: var tenses, but afeo note that 
the sbbrevéation has: been: used: by: #11 l4aguins thesul@t/PARALLEES for both noun 

: groups. This regiater:is alee veed to ceppesent perette! ten exnithl ted by 

con jumotions. -{1n. ove ensupte sentence: above, the RBS’ efor the tuo conjoined 
Clauses: weutd. etec. be: mepkedcpaereds tel: )-RARALLEDS: le ated used: for paratte! 


‘main cteuses, either connected. bys enni-celen-er hn euparate- sentences: The 


panic. idea: ie that this paratiot isa intommatton-reinét’reatly tengaage | 

| independent, since:aeny: Languages’ might’ have-ddt torent ‘Pules: for conjJunéetions 
or noun group. pseude-coreference, Heuavee thi‘dee-6#° the PARALLELS register 
saves time spent fr making. compar teoge in thereat 


4.9 ee ter esis 7 
Thematic systems represent’ thererganization of an utterance as a 

mensage. | As:euch; sheyare:aot ‘pee Wheted: to thesahéuduree ‘bevel thee. | . 
linguistic ongesitzetion sbove:the sentenced. In-faat) ‘thebatte eystene can be 
found at the:werd; group, endcciouss levele-creraystentccgraiear.’ The | 
distinctions in thie seothon erecedapted tren: Hal Pideg he s00k ; ‘Hepe fal fy with 
the original intent tone-intect.: Reedere:utehing: te judge fér thenselves are 
referred to Hal tidey. (kBe18b< — 


All 


4.9.1 What's ina Senentic. Representation - - Revieited | 

_ Before ‘considering, thematic Phenomena in detell, let we return. to the 
question shat should a ‘senantic representation represent?” We can divide the 
information, in a senent ic node (vee section 4 3) Ante, _thres classes, one of 
“unten wii be thematic. * a Bi age ae oud eee 
ae, Firet, the eonantic representation contains Information 1 from ithe ro 
‘propositional I level - wore, or jess. ; Strietly speeking, | I would consider the 
pregos| sioner level to be raw knowledge - ; fementice minue the | thematic 


austen. But, ae we wilt see belou, the. “bject-relation-propec ty Gistingtion 


PLE aes 


’ can be called thematic. To call the Brapge! sional level jogmantice minus 


- thematic Information, then, means. thet, it ALS a extremely. | lou . Jevel of fis 


arGan ization: Let me therefore qualif Gis origina eta ment and oy that 
: taatg lapse c® yt ‘i grat, Bias 


ga 


the semantic representation contasine Information from | 


the } _propositignal level, 
suguented Py, the Orlow tere et on er coer ty distinetion. . In this gategory, | 
una place the registers CONCEPT, CASE, LINKAGE, VARIABLE, and BREERENCE- 


SCOPE. : 
* a 1 gi jee = $ % 
ay feo, & 


A ‘second pind of Information in the ‘senantic “epreventation + reletes the 
ur face structure ot an utterance te thie, seni-propoe!tiqnal Jevel. For this, 
we have the regieters TPE, OFOER, AESTRICTIONS, OSSNODE., ‘RSGNODE, _PSSNOOE, 
and PARSENOUE. _Thie ie Information that is woeful | Jn deciding + whether * 
representation is eppropriaty, deciding _betwego. gifferent eanen one pementic hs 
representations, and keeping track of | the Mey, the copmtic copresentetion 
corresponds to the syntactic structure thet |e being built by. the parser. 

The rest of the information. in. the gomentic erevmtation iq thematic. 
This includes CONNOTATIONS, GIVEN-NEN, ,COREF, PARALL D . 
TYPE, MODIFIERS, , THEME, end RESTRICT-OESCAIBE. . Tre ov ot. fthies section 
will be devoted to these. thenatic. categories. pee 


A ABER god Oe 
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4.9:2 Thematic Organization Below the Clauss Levei . 

Host thematic iriformition seeus to be releted® ‘to the clauses ~ the group 
level: thematic features that will be discussed here sre also found at the 
clause level. Une’ ‘system discussed in this ‘Gection = - “information focus : Is a 
discourse syste#, and should properly be representa sove the sentence level. 
Since, hotever, ite wani festations are seen ‘at “the: group “end ‘clause levels it 
UII! Be represented at the rélevent group end cleuse, cenit ic nodes. 

A thematic plenowenon at the ward level is conmotation, releted as it le 
to the “epeaker 6 choice between different wage of expressing the cane congept. 

Looking next at the group level, the heed 7 wedi fiers ‘Vinkage exons to be 
thematic. Corisider, for example, the al fference b betiieen ‘the ‘too noun | Srouper 

Gi) der blaud Hiseel / the biue sky ° 

(11) die Biliue des Himels / the blue of ‘the oy 
In the sénantic representation, the a di tterence between ical ie the 
MOO Te TERS regieter. ; ae OSE: , TSAMGS aceteigey ott goa cp ete 

Another group level thematic feature is the dtetinetion batueen ict: 
palations end proper tive. Objects ‘wd relations can ‘both bee veen, as bundles 
‘of properties: ‘Tn eat ining them, ve are ‘waking a comni tuent ti to a coherent 
world view, tee. fo “some sort of "identity! ‘in the cave ‘of ‘objects ‘end to the 
assutiption 6f “relatedness” inatead of randomness in ihe case of “relations. 
This conceptia! leép of faith déee not soon to bea con 
part of ai Individual speaker, but rather a chofce that Ie bull’ Into the 
lariguae. Thé reason I séy language here ineteed of conceptual structure is 
that it is podeible, ‘for example, that the concept of “objecthood" differs 
“from fariguaga te language, ad Unort Zontends in hie anéiyets of Hopi (36). 
Never théidds, for the translating ayetén, the ch Jett, ‘Telétion and proper ty 


. ioue choice on the 


ddictive data baoe. 


distinction te expected to be salntéined tw the 


ea a ba 
ar Pees hI RRs © La pp aie Ped 
ib 
“a as 
" ie dei 
! 
ta he > 


This is because the distinction seems oe be -_Intewral, to both h English s end 


xq? Oty 2 : MR Meee eer BIA $ 


German. oie in Moair languages do _Gistinguish sop ipcts. relations. end 


oe Pee Py a: 


proper ties in different waye, ond if hie, Fetlecte SPS conceptue!, gl f ferences 


as well, then the conceptual _repreventat jon chosen, bere. ie heavily. Jenguage 
dependent in _ thls. respect. . 


ye Sal met ahh LPS op 
Finet ly, the Glecouree categories ° iven" and d nen, * thich, belong. to what 


Hal Niaay, calle the Information system, A199, 9ppeer oe m2 she Joye! thenstic 
ethic Since this system is Bind Me of m Inari ly phonological ly, .only, certain 
aspects of information organization are. pping to, be.rgipvant, GIVEN 
roforess on. is that uni¢h.. she speaker (uri ter) thinks the. J Listener. (reader) 


Be Bhg |) & GZ 
can deduce, 1 ther because it hee. been, stated complicit Or, because | it, is 


rS 


common Engulocoe or because, it, Js Lin gous, pence unig at etc. NEW Inforgation, 


on “the. other hand. Is the. ‘reason for. the, yor iter’ “A, Pentence, 120... the. 


st 


16 


information which he wishes to communicate to the reader. 

One place uhere the piven-neu. digtinction ie reflected in text ie in 
definite and indefinite determiners. “The difference between the noun groups 
under | ined | . 

‘Das Buch ist verschwunden, / The book ‘nae.-di sappeared. 

(Gi) Ein Biagh Yet verectunden. / © : A book hee dleappeered. 

is that in (1), the reader is expected to know, or very soon find out, which 


book is meant, while in (11), he ie nots. ae semantic nodes corresponding to 


noun groupe, then, are marked by. the trdnetating eyeten with either GIVEN or 
NEW. Note that thie weene that eous, retations and properties are also marked 
along the way, 1.0. those that dre expresped ae. noun.greupes “das Schuinmen” 
, “suimmihy”, "dhe Baue” / / “the blueness.* Pronoune, of course, are 
automatically GIVEN, since their referent ie aluays expected to be derivable, 
either shen they appear (for back reference) or a8 soon a8 more of the ; 
sentence hae been processed (tor forward reference). wen eee . 

Another place that the information focus ‘eyeten le rettected in text 
(although it is not now handied), Is shoun in the fol loving oxeepler a 

(i) der breve ‘Klotz / the blue ‘block _ | 

(ii) der Kiotz, der blau iat / the block that Is blue 
One important difference between these tio is thet. the entire noun group in 
the first example must be GIVEN; uhiie ‘in the ‘second, the eubordinate: clause 
allous "plue” to be New, even though the reet of the noun group is GIVEN. 

At the nord level, then, there is ‘connotation, and at the group level 
there are the distinctions betuesn heed aha wodi flere end. between objects, 
relations, and proper ties. These tuo. group level. ‘distinctions also appesr at 
the clause fevel. Finally, while the information focus yeten is proper iy 


diecouree level, the given-new distinction ie represented both’ at ‘the group 
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and at the clause level. Note that tuo other reisters. In —, semantic. 


vd 


representation related to information focus | ore COREE Bg Pi 


were discussed in section 4.8 ebove, 


seds te 
4.9.3 Thematic Organization at the Clause Level 

_An impor tent clause level  shematls, eyeten | ie, the, : eneriaee 
distinction. In. terme of Ha} | iday’ s dat inition, the, theae. is the first. . 


Set aes 


constituent of, the clousp. _ In the _transtating eveten, . therefore, the. thpme is 
. marked by. the syntactic component when. the Flayes is persed. but it Je. also 


given @ semantic representation. In epmentice, the theag register. on the 


clause RSS ie set to the ¢ mantic node , Segoe lated. with, the theme. Hal! liday 


has character ized the theve as “the Peg c on which, the o nesaage..\s hung" 
as, P- 7161). . The rhepe is. the rest. of, the, claves. 19, tpree. of. the. ‘Infereation 


Fat & 


| structure, _the thene js often GIVEN, | oe ing. ue aa er 


Us peplelons. f. Tam sogamelgpincan Pv, 


This Ie not aluaye the cage, however, ae, cho by thie, gpatence | from fhe. 


samp 8 (paragraph: 


5 SLY Bera 2 oe on aat Ok ‘giceies - Feet +s te Bete og oe 
Nach von Hess, sollen sie sich ule der terbenp inde, tenach, verbal ten- 


According to von Hess, their (the cephal og mene: Is, Vike that of a 


color-bl ind nan. ee ee ae eee 
From the reader's viewpoint, theme ects as a pet of directions for 
interpreting the latormation in the sentence. When the thene is GIYEN, the 
riter Je sayings “Here le a concept ith yhlch you. ere. fpel Har, on wich you 
can hang the information I am going to give you.” On the other. hand, when 


cone, giving. Ipfggnetion he..coneiders 
helpful or essential to Interpretiog what HII! be.peld ip. the rest.of the 
sentence. In the example given,. it ja Important. for. the guther to qualify hie 


theme is NEW, the writer. Is setting, the 
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statement by attributing it to von Hess. In essence, he is ‘eaying, "Don’ t 
assume [ bel leve what I an going to tell you." . In the example sentence, this 
qualification is atese expressed texteal iy by ‘ies use “ot the verb ‘pollen 
("supposed to be"). In fact, the author — on to. give evidence that sone 
squids do perceive coter. - eg 

Another thematic mechanien that relates to what the reader is expected to 
do with the information he ls given is whet I will call the peetrletiecechibe 
distinction. Basically, a head-aodi fier type relation is RESTRICT if the 
modifier 16 expected to give the reader useful or essential help in 
identifying the READ. DESCRIGE Intermetion mey else be useful, but It is 
treated by the writer se eupplenentary information. A relation hie the 
attribute Restrict when modi fiere are used to limit the reference of the head. 
"Der rote Pulli® (*the red seater"), for ‘exampie, euhivite thie kind of | 
relation, since not all sueaters are “a, ond’ the adjective ‘has been used as a 
dtetinguisher. “Ole rete Peuereprites” ("the red five engin”), Gh the other 
hand, would probably be & DESCRIGE reiation. ‘thie | ‘is pedauee red is generally 
& property of fire engines, and so the wodifier hee been used. ‘purely = 
descriptively, rather then ae ah attempt to single out a ‘particular sa ars 
: This distinction’ ‘is reflected eyntactical | iy In the Engin re restrictive and 
non-restrictive Clauses: . af 

(a) Cophalopods thet jive in cosstel areas bulld their houses out of 

“Seba, 282 ERvlg, t4 Gy kth Utara ae ae pie toilet 

(6) cCaens epee: which live in coaetel ovens, ‘bulla their houses out of 

oa Ses fess vO meta eran ayy eee aunts ate peg 
In ta), the ee 1e Weed ee a Gietinguisher, white in (b) it 
Gives subpledintéry descriptive lifornétiin. Note thet Engi isn requires 
committe. for fo) but not for (al, nd the distinetlon Te often emphasized by 


rk 
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contrasting Shas andwhich. In Gerean, en: the: other hands; the-transiation for 
oth (a) and (b) would.ber balgouce oteiity cate 
Die Kraken, die. in der Kistenzene..Jebens.vaven.itreighnhible sus... 

Steinen.  _ een oat lee gtd | 

or better - 7 ; 
| Die in der:Kdstenzens lebenden Kraken euen ibre Wohahbhie.sue Steinen. 
Since both types of clauses have the sane. sur face realization, -genentic ae 
’ knowledge must be used to make the distinction. Here we have a situation 
where semantic interpretation is neceseany: for: Gernan:t6 Gagttsh tranetation, 
since if we are te chonse ‘the. correct Englishcrepresentat ion foreuch a 
“clause, ue. must interpret the German seoneectiye 00 <' 

7 Ancther clause: level. frature: is. the regieter:LNGO-GROER. > Ef one-had to 
classify thie register, it would. be essigned:.te: the. Infermation: focus eysten, 
although. the INFO QRGER register iteelf-ie-e very @d hec-seasures The 
semantic component checks the sexantic ceeee-of edvenbighe-in.the cteves and 
marks INFO-ORDER accordingly. If the adverbial ordering is the default one, 
the register i@ set to UMWRKED,. .1f she oeger ing te not the default one, then 
the register is set te.a List .o¢. the RSG’ »:0f. the-edverpiale, in. the order of 
their appearance in the.sentence.. Presweeply, thoee .neerest. te; the.end.of the 
clause.are, consideced wost..importent by. the,apeaker; ivelens there. |a..s0ne 
other reason for the ordering, like abundeps-pedi tiene) end: thle. sight be 
useful information .to preserve for the generator... ; 

One last thematic. category used.at the.claues Leyal, ie .CLAUSE-TYRE,. which 
is a register set on the RSS..correspending:to. the .claver,, Tole regheter,may 
have the. valuss. COMMAND, QUESTION, STATENGNE: or SECONDARY, It.is-aeaumed 
that this information will be weed by, the .genereter end alee.oy the deductive 
component. We would expect the dedustive..compenens..to.b00w sone thing.<sbeut 


+ SRE te woo eg + RA SL gs 7 + ae MIM sep eae ed eee ot 
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the imptications of the type of claves ‘weed, i.e. the préesuppos! tions ‘end 

_ expectations associated with it. For connected text, thié ¢ort‘of Information 
would primertiy be useful for disambiguation, tse. to Indicate the ways the 
information in @ clause could be used in deduction. For nore ‘interactive uses 
of language, the expectations aseociated with CLAUSE-TYPE would fridicate the 
‘type of action that sust be performed, e.g: carrying outa task, finding an. 


anaver, etc. 


4.924 Discourse Semantic Structures: 


Origtnalty, I planned a seperate-discourse semantic structure (DSS) which 
was to be associated with each eentence:and cerry hifersation avout 3 
intersentential relationships. Except tor ‘the’ Internation systew: treated 
‘above, however, very little of this seens to be derivable from the eurtece 
structure of a text. uti! defer’ @ discusston-of dtetour'ee tevel’ 
structuring, then, to section &.7. 2 


4.16 The Place for Case 
' A case grammer in the style of Fitinore aeeestnantic case Information 

for ‘eeverat purposes. For sdae' of these tunetferie; I fave used other — 
nection! ens:in the system. When part cuter prepestt lorie are required ‘by the 
verb; for exampte, the coftocattire 11ét' is ied teiction 4.5 above). “For 
objects of the verb in general; T have ignored“dese eitirely: aseuming that 
this information would pe used’at the: déduttive fevel. “Winograd’s syeten has 
the case-like global variable ‘SMLO: CHeeat Tonk; WAteh "Te bound to @ location 
required by the verb, a8 in "Put the book-on the tebfe. My systes, in 
' ‘contrast, uses only SMONE, SHTUO, ‘ond SHYWREE: te épectty pier ticiparite tn a | 
" retation. This dae done bectude: selection testrtottote fii ted the rote of 


‘ 
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case here; SHLOC and the other  Meseckn that one ould. add, Hike. time and 
manner, were redundant. brake APRS otras co 
Senantic case does, ‘however, have 4 1 place, in the, “trmnelatiog eysten, 
although In giving it one I have extended the meaning of the terms For the 


rest of the section, let. ue consider , modi tying, relations exc! 


PAG Ra 


usively. In the 
systen, selection restrictions speci fy the oppesreinte & a. pod fler places, on 
ite head. In addi tion, we also, need, a Hoy t t o,grorese, the. t tyoes.of modifiers 
that a head can take, | For example, we might went to specify _ that. evente.can 
be modified by location,.tiye..end manner, or. thet. peyelee! blecte,.mgy. have 
shape and color. These, constraints ere not, now, implemented, sings I expect 
them to be embedded in the deductive routines:, What. js, implemented. le, the 
character ization of the Individual mpd fying, ce ations by. high, Jevel, _ 


categories uhich I wii! call semantic caeesy. orlentgtion, tecations oh 
etc, . oe 2 | | | 
‘Unite the constraints on nodi flere have Det. .bgen_| ivelgnentes. eonantic 
case information does have, other uses. In. the sustons mc. for order ing 
modi fying relations are expressed. Ip terns. of. Fane. ad hat. ig," “gis greue. grosse 
Tintenfiech" ("the grey big. squid") Sounds strange .49.b9th. Garvan. and. FEnalish 
because the rule "size before color" has been violated. (Note that we may 
need more generality than the simpie case categories to express ail the 
order ing rules, but at least case goes a long way toward expressing the more 
common regularities that occur.) Cows ateo helps to explain verb modi fier 
ordering. In German we would be more likely to say “Wie traffen uns gestern 
(time) in London (location) ," while in English the more frequent arrangement 
would be: "We met in London (location) yesterday (timed ." 1 should add that 
right now the German end of the eystem does nothing more with case then find 


it. The semantic component does not care whether it sets "die grosese graue 
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Tintenfisch" or a “graue grosee” one. These rules, of couree, could be added 
easily enough. Semantic case information doss have # place in ‘generation, 
however, since modifier order bg rutee in the target ‘Venguage are ‘expressed in 
terms of case. Caen 

The mechanien for retrieving semantic case jadewmetion is @ simple one. 
Since I am assuming thet different concept nerkers aiuaye imply different 


cases, the logical place for case information ts the selection restriction 


. tree. To build the senentic representation for a relation ‘concept marker that 


“acts as a modifier (including, of course, MIAVE-PRUPERTY], we trace up the 
selection restriction tree until a marker ie found with Ite CASE’ proper ty set. 
Semantic cases are aescciated with subtrees ‘ot the eatection restriction tree, 
al though: there may be several: ‘cases ‘slong @ ‘branch. “This al tous us to handle 
“qxceptions, since the first cass found ie the one ueed. | 
ee 

In thie chapter we have discussed the setentic representation and the way 
a eet of representations are eesoclated uith individual sentences. ‘The next 
" chapter discusses sowe of the issues that’ niet be cone} dered if te are to 


4 


choose a single (ntarpretat lon tor a given ‘Sentence. a 


_ Cepresenting and structuring knowledge. — 


12k 


| ‘6 Introduction 
eretanding @ pnp tegne may, he sheugh of as , the process 
of relating it to an Internal. knqn ledge atruetung.....A0, HAR already nent ioned, 
the translating syeten Implemented makes no gnatura, tonnnd. snyeratanding,. but 
instead by-passes the problem entirely (4) th the apepibiaewpent Lon of. 
selaction reatrictions).. This omiselon.ariees mat out.af ithe, ne Lief. thet, 
underatanding.le unimpartant to translation, but, eather, ouk of the... 
conviction, that a fragsantery ealution. is pa,polutien.at: ally, Hbiile the 
__ understanding component, for. the oyaten reneine Ae PAsPe.bon,". the mechenipa 
“Needed to fill. this gan le not.as LI (-deFined ap tenga, Hee... Repant.aark by 
‘Minsky (27), Charniak @), Galdatein (9), fielensat t 2B, ard. Suponan AQ4) is 
extremely exciting, and_coneti tutes. ae progress, toward » theory. of 


The process. of .und 


Bes ae 


The chapter that folloug ral, igs, jm ‘ican the. Isens,. ln tha. retarences 
cited above, but I will. be considering much, more. restricted quaatione. .Eiret, 
given. the aystem that 16 decribed hare, ubat sorte of intensctions vale we 
expect between a know/edge structure and the rept of. the, syatem?. Second, : | 
will take a short look at tha. sorts of special, prokleme, that cone. up jn. text 
and some of the cuss Wa can, take. advantage of,...Jn wont fel lqua,, Laid 1 nefer 
to our “black box" as the deductive component. although this.teca.is 
misleading, Deduction, is. probably, en, .lapertant, part. of anderatanding,. but not 
necessariJy. the primary, mephanion. L..use..the.nord.,?deduotion”: inetend of 
"understanding," however, since the interactions out! ined, be toman the. 

_ omponent and the syeten.here. Right not. be:.ident ion! to Abe Antersotions 

_ betheen a more ganerel “understender".sed @ syuaten'.e Jingviatic:compenants. 


y Dade tid « Mabie eee Dee A pe Pe 


es i ee Sg RMaDRY eS Lees rte tsa citer creer gi Sie ee et gee c sca ee 28 
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In what follows, I will @seume that the deductive component is written In a 
language with at least the representational power of Conniver. 


5.2 The Basic Functions of the Beduetive Component 
Underetanding ptays @ crucial role ‘at the interpretive end of the 

translating process: ue need to understind ‘In order to decide which sense of 
a@uord is intended, to unténgle pronoun references, and so on. We would 
therefore expect strong Intersiction between tte dethictive component and the 
parsing and semantic components. As will be discussed In the next chapter, ue 
might also need to tran on our general knduledge stricture for generation - in 
particular, when paraphrase is necessary. ‘This semis to be @ more specia) ized 
mechanitem, but I aw not prepared to discuss It further, eo It will not be 
considered here. Thie teaves, then, the Interpretive role of the deductive 
‘component, which cen be divided into’ tuo ‘tinct lofe: disambiguation and : 
eupplying information that ie implictt;’ but not explicit, In text. In terns 
of the system here, these tasks can be reforsutated. as choosing between | 
possible sementic representations and fliling in the slote left open in them 
the MUNSPEC marker). Theve processes ere not Independent, but rather | 

“intinately interrelated. Clearly, a cholce betiden répresentations is made 
“easier uhen all the Information te in. On the other hand, we can wake @ 

- decision about Impticit Information onty when ie have committed ourselves (at 
least temporarily) to @ particular context. The situstion te not as 
hopelessty circular ae my presentation of it; I merely uish to emphasize that 
implteit informetion can heve ite uses In di senbtpuation,” and diene iguétion, 
in turn, will supply implicit information. 

Wie cannot realiy eek how the deductive comporisirt wit! interact with the 

rest of the eyetem before ue ask then Tt witl do eo. Idee! ly, of coures, the 
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understanding section of the system would be active, directing the pares. 

. Since, however, text interpretation is stéil et. the otage: where: syntactic 
information is ueualiy: the best information available, the question: is really 
when the semantic component. should cal! deguotion.. A likely: piace to meke the 
call is at the end of each semantic specialist, eo that bad representations 
can be eliminated right auey end donot hive te be erred foruard. In 
addition, if ald representations: are rejected. by deduction, -wechave some good 

> Information to send the perser.. The: sooner’: deduotionsbesi |zes: the eorref, the 

ieee complicated backup wil! be, since: we. ere etii! iat. or near: the:ecene of 
the difficulty. , 
Understanding ienaG however, a. sing!s:aonol ithte ‘process: Sone e 

‘tinguistic structures require & del ayizr parte: of the: process. | For example, 
where In English: we would: say; “Give-ne the rest pened): aed. the bive ene,” the 

German could be, "Gib mir den poten wnd'wnn bissséll Hitatsté*” (1 ltenat tg, 

“Give me the rad and the biue pancli*):” Hore, the’ rpet houk group ‘tn ‘the 

conjoined structure: canniet: be tulty evalusted wntlt : the eécbtid tds Beer parsed 

and evaluated. Simiderty,: (@ Gerson: end-or der. cleuse eonstroet requires*that @ : 
good part of the processing-of the: cleuse mes wait for the main verb to be 
parsed. Constraint number one On our: + deavctive component, then, is that it 
cannot be an al l-or-nothing: process. ~ Information: should be usable as it is 
accumulated. The deductive component: wight ‘be abte; to reject some possible 
semantic representations for a given noun: group before’ the ‘rest of the 
sentence has been evaluated, but @ more Hkely: ‘funation: dul be to reshuffle 
the priorities of: difterent possibilities. - ‘Thi updating process wil! be 
discussed further in section 5.4. Related to. the uge of partial information, 
we would also want to activate information net’ juee’ for full grammatical unite 


(e.g. noun group, clause, etc.),- but ‘eleo at. eertath lepers intermediate. 
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pointe, for exanpie, as: soon aa a verb. is: founds io Thue, the: tupienentation of 
the deductive component: outi ined here woulda lee! require. some changes: tn the 
_ eomentic- specialists cuneentiyein the system. «ccsess . 
5.3 Relating a Gantence:te: the:Kmouladge ‘Structure 
os Lt ke aeeumed:that the deducti ve-comenent site activated by an 

approach sini ler to- that. used:depblineyrad’s:eystem Gach possible vexantic 
. Pepresentation of @ sentence is converted into e set:of esbertions. for: the 
deductive date base, and procedures ere automatically bul it’ fvow these | 
_ «aseer tlons--te. per fern. the necessary: deductive present ing. :Conversion of the 
: pamantic ‘representation to: an seeertion: eet:is: straightforward: The: var lable 
‘19 -an OSS 16 compined.si th. the concept: sarker: and. the-relation #5, : for: 
example, WIS. XL. nC) » Bagh, ayo, tap laniie- given. Vtesosm: sesertion name 

0. well, nd.an saeer tion name can: eppee aan engeant: -4n ether: tuples. For 
'. RSSi.a, the relation. concept.sarkers: are. conbined-tcithe the: veri abies.of:4heir 
arguments (if these are.06S's) or: the-eceat tion:Nemee:of the: relations formed 


mp (#1S X1 OCTOPUS) 
Aes US Me: TANK}. Wie CP as 
a3: (#IN XL X2} | , 
Sa Mag: -AMBEATION Xi: A). ee . aes bel. hate, : i 


Figure §.1 
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from erguaents. (if thage ere RSS’s). Finatiu,. modifying. raletions are.given 
assertions baeed on. concept. mgrker e. and assertions baged: On: ogee, . Figure 5.1 
18 an example of the assertions. that could.be proguced. ter.9.mod! fying 
relation. For the special case.of the modifying .celetion-MMAYEZPAUPERTY. we 
bul Id only one assertion, napely.one. made up.of the coon af the, property, the 
variable or assertion nawe of the head, ond..the.pcenerty’ a concept marker. 
Figure 5.2 gives en example of this. Thee, epaect ana.do ‘fot represent al! 
the Information in the semantic.representation, and..it Le expected that, the 
procedures wil! use thematic information s.they.ge abaut. turning. the 
assertion eet into routines to be.uesd. by tbp. deductive.cempanent. just how 
thie will be done, however, epeme to denend. vary such on, the way general. 
knowledge le to be atructured, so I wil! not. cenekgeroM- qunetion further 


here. 


AS: = (#IS X3 WELEDONE) 
AG: (#COLOR X3 NGRAY|SH-YELLOW) 
a Pork 


When tuptes like thove given ‘above are aseer ted in the deductive date 
base, It is expected that they wil! ‘trigger “all or sone. part of “the related 
information that 1s stored as permanent kijouil edge. “tn ‘the simple hierarchic 
mode! from Chapter 3, thie would involve @ chain resction up through ‘the 


eu eo 
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hierarchies, so thet: (WIS Xl MOETOPUS) would triggér the assertion (#iS x1 
MIATER-ANIHAL), (ATS°XL MINVERTEBRATE), (#TS"xi°WANTHALY, etc., along with the 
related information In each case. Most acéerticne til! be more comp! icated 
than these #1S tuples, and we will want to be able to récord information about 
the status of an assertion, e.g. under what’ Gord! tions an assertion can be 
expected to hold. Certain ainimal dletifictions ére easential. First, we 
would want to distinguish betueen a relation thet we know does not hold end 
one sbout which we simply have no information. Ih addition, we want to 
distinguish between a relation that is unaseerted but could hold for a 
situation, and one that is not only unseberted, but also ‘irrelevant. For 
example, we can avk “Where did Hervey put the book?* but not, “Where did 
Harvey put the rapidity?" Location, of coursed, ié'relevant ‘tor concrete 
objects but undefined for abetract ones. Furthersore, we would also nant to 
qualify assertions in the "yes" and "no" states, by the sources of the 
information, the times the assertions are In this state (sonetines, often, — 
Mondays only), etc. The nininal’ number of different assertion states, then, 
is four: “yes* (it Holds), “no” (it dose not held), end two varietes of — 
“unesserted,” el ther “defined” or "undefined. * In general, it is probably e 
non-trivial taek. tq Bateruine ubether or not a relation pquld.aake sense if it 


were asserted, bit certaln broad distinctions cen be sade. 


5.4 Positive Selection Restrictions 
The use of a verb raises certain expectations Bhout the pature of the 
participants involved. Similarity, attributive adjectives raise expectations 
about the noun modi fied, and prepositions carry.conatraints on their objects. 
The selection restrictions introduced. in chapter 3 al lowed us to express | 


minimal conditions on participants in a relation, but there ie more 
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information that is potentially useful. For example, if we know one of the 
participants in a relation, we often have a much better idea of what the 
others might be. Let us cail the restrictions from chapter 3 negative 
selection restrictions, and define a positive selection restriction to be a 
‘plock of information about the expected participants in arelation that can be 
used by the deductive component to decide between possible interpretations of 
an utterance: When we try to specify more closely what positive selection 
restrictions should look like, we run into some issues which we did not have 
«é face when deal ing with their comparatively simple negative counterparts. 
As mentioned above, if al! possible semantic representations for a phrase 
are rejected, the parse itself will be rejected. Since we might expect 
positive restrictions to be rather intimately related to the general knowledge > 
structure, we could find ourselves in a situation where an incorrect statement 
by the author violates the restrictions and is sent back to be reparsed. Thie- 
is obviously undesirable; the deductive component should distinguish between 
false statements and nonsensical ones, at least as much as possible. Note 
that nonsense will be considered a misparsing here, since I am assuming that 
when a nonsensical statement actually appears in text (as in children’s 
stories or a discussion of "colorless green ideas"), we will have been 
adequately warned by context. An example of the potential confusion of 
nonsense with misstatements will probably be helpful here. 
Eingehende Untersuchungen Ober einen etwaigen Farbensinn der 
Cephalopoden sind sehr erwinscht. 
Thorough investigations on the existence of a sense of color in 

. cephalopods would be very desirable. 

Let us assume that erudnscht is defined by the concept marker AWISH-FOR (We 


might want something more precise, but this will do.), and that the first 
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argument here is the general ized WPERSON modi fled by HANY. “To snecity the 
second object, we would give Vs the negative restriction SRELATION o end treat 
the situation where an object appears In ‘this place as the sort of ellipsis 
discussed in section 4.7.3. What eles cen be eaid about relations that are 
wished for? One impor tant factor is that the relation dues not now 1 hold, or 
at least the epeaker believes that it dose not now hold. If, in fact, 
conclusive studies of cephalopod color percept ion did calatca at the time thie 
article nas written, the reader must ssoune thet the author did rot know about 
thes. Gur knowletige of theses hypothetical otudies would net, however. block 
the interpretation of the sentence altogether. : 

Tt is harder to envision ene positive restrictions beconing tnvolved in 
misstatenents than others. Our “wish-for* ouiaple is probably ‘s fairly ‘common 
candidate ‘for miéetétement. On the other hand, the enchenge: in aD ie ite 
a bit feds tikely to occur than the one in (ij: 

(1) Harry i6 @ bachelor. -- No, he got married Saturday. 

(11) Jane i¢ a bachelor. ~- No, she's a feneie. 
A mistake about her ite! ‘status is | Presumably ¢ ‘easier to neke then ¢ ‘one about the 
sex of an individuel. Oe 

For ‘positive esiection restrictions, then, ‘our knowledge structure ‘should 
provide @ mechenisa to eveluate how Vitel a ‘epeoker le to neke ‘@ mistake. 
This mechani om could take the fore ofa velue, ‘function, or * procedure 
associated with each restriction to calculate. ‘the Srobent lity that the 
restriction wilt be violated in the given context. ‘We aight ‘not need 
completely explicit Information here about the Thal ihood of ‘violation, since 
some of it might be deducible from the his wage structure: if ‘knowledge in 
~ the systen le arranged so that certain facts ére much 


hese Prone to mistakes 


than others, then the sijsten could sesume ‘thet ‘the writer is ‘also ‘auch jese 


likely to make misstatepents of this, variety. | AFT -coucen,...thie.would.only 
serve an @ take-off point, eince: particulgr content, leyg,.wnether a child le 
-weiting) or other. micatatenents. from the. .sgne.qguree wight, ¢lter our 
expectatioria ebout the. |tkellhopd.of pecticular errege: i 
_. Interested readers.ace referred. to.tedecnatt J28) whieh, treete. queatione 
of, belief end doupt, in the, mente ae informptign. ... 


5. & Fitting | In the Blank, Baling. th, MINSPEC 
_ We, have ‘eald, that. in addition to desing betuney,poanlele (te, aoe 

rapresentetionny. the. deductive component. whguid be, Apia. $0. suppl, Intecagtion 
that hag been left implicit in the dext..Ja. the. inp} peentation, it bgp.been 

aseuned. that such. implicit information copes. in. tuna yeedatiees ,Apformegian 

that willbe necessary in. generation ard. joforgation,that. MAL mote 

Therefore,.a Jot of implicit, intormetion, ij |. ops. baeptiected, 19. Abe. aquantic 
_. representation, at.all.. Consider the axgmpless oo: esccuneu! | 
+; Merkur flog nach Athen, / Mercury, fleu,ta, Athena: 

_ Lindbergh flog.nach.Paris. / Lindbergh {Jen to, Paria... — 
‘Every object has an implicit time setting and duration... Af. the, ieccinre here 
ie the historical one, ys Hauld have, good evidence far. peayming, that he made 

the trip under hia.oun pover, since the. airplane bad not yet. been. invented. 
. On. the other hand, since Lindbergh's, flight pastdated the invention. of the 
airplana, it would be highly unlikely that. the (wing, hers, J9, done, in. enw 
other way than in.an airplanes Aknaging. that areury hed, wings gon. hia feet 
and that Lindbergh made .the firet U.S...to Panis nonatep sale.flight would be 
even better information, of course, but that ie not,the paint here.) Implicit 
noun tense, than, seams to be useful. -informatlpe. far. Aigamkiquation, ——. 


Since implicit noun tense seens usetul,, should. alec have @ place in 


Cad tas PES aig se SG ahs apaatadis Sia soa or ae temas oe eee 


the semantic repregantation?. hata’ sqroup at. ainda of: California at 
Berkeley. hae identi tied the-queation ot Wieliclt’ tata 
one. for, transtation. In the” ate ah ts ssi? pic’ ‘naive not 
tant” ‘ot pb jetta, must be Expt icl tty 

#18 th. peek Vek depatbtFenelpt lon. , 
(There may, of eéurse, be “energency si tuétions” nete’ the” ‘generator cannot 


encountered any |. 


repressnied.in: 6 chines <i 


find a wel i-forned transiation and. might: need. ta make.addl tional. calls. to the 
deductive comintiant.- but. right riod. al ‘ey v ediidenied gees in which the 


ert lion .aiet which mua 1 te: Renee 
seeunption.of tha’ brecictamitity ot emtictt: irate ie 
correct. ines ‘ate: thi: bececian at alls en aishaget | 
ply’ Tahiguape’ capehdint;: that 16," deblandent 
on. the tuo. languages involved: . teat ened stan lite tants teen. 
selosety related: than’ Cerner: very ie tevee 
@ semantic cecninntislatidknnd'0e Jal tter veil 
“fav Ite representation-tul iding tehavior, |. Pee 
‘We. cart say, “then, that. the deductive ‘compet tL11. be’ eupalylng implicit 
information both fF Me Bin dubs | 


Rothe eénanttc ° 


stan. bosineild ni 


us ‘eta ‘to Mette Hho wemerittc © 
representation for the ‘yeneratot. Let Ws coticen thats on’ tie lfiformation that 
WIT! Clearly beinkcsenery for génbration na ‘eg nod1t cah'be suppl led. In 
section 4.7.2 us Uhecueed tie ude of MUINGPEC Tor thifige I1ke tives, 


‘locations, agenta, aft. With @ aechaniee such’ ue ‘that proposed by Minsky 
(27), this sort of Information would be supplied by a rich default structure. 
Of course, a default tiged notte completely sbeciftdd ‘by the internal — 
structure. For exemple, ue might chodes a generat ‘default for ‘the location of 
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an object to be, "If there is no reasen. to, think etherulee and the location of 
the object has been. specified previously,. aseume. it. J8 etil.in the. same 
place.” It seens fair te assume, that defaults. MIL weueliy be heavily context 
dependent. Aiphone re sda oe 

In chapter 4 we suggested the use of were. for .¢ all ipthe.olsuations 
where a relation use implied by its erguuente. There seen:-ta be several. 
possibilities here, The firet variety Sppesre.in our sample paragraph: 

Ob die Cephalopoden selbst: auf Fachen Fare Ha sinn ,is¢..q)ch. POR aAAt ees. 80 


scheint das zum mindesten fir diese 


2k TUPI - 


_ tether the. cepeelaeecs even react: dee eolaes tenet knowns. at: boast for 
these species, this appears to el a ci eons, 
The full relation hea.been presanted.directiy.en, gare cho Luopdietely. deduced 


from the context, and the ergument of the.reletion:sgpesre. tater as.an 


abbreviation. In. the. example, we.would. expect the: deductive component to use 
information about reacting to inuediately.retorawiate..the first sentence as a 
. question of whether cephalopods have @ sense:ef color,:: Later,.shen ve nee the 
.. reference to a color sence, we can check context and eumply the lepkicit:. 
relation(s): | , 
(#SUPPORT 

Als 

(@HAVE-FACULTY MEPHALOPOO. WO. OR -SENGED) E 
Here, Al@ refere to angther assertion, Thue, with: good- understanding of what 
is happening in the text, this sort of ellipsis can be handied ine — 
straight forward manmer. 

For implied relations that are not.iumediately eupptied by context, the 
system will need a more general machanian.-for- finding a typica l-eelation given 
one or more of its ar guuente. We will see thhe mechenten:ueed again for 
compound nouns below. I suspect that thaee non-cen textes! “baptied relations 
Hill be draun from a rather restricted.group, with relations tike MEXISTENCE- 


con BRR: Sate 7 tO pee, fy oe atta ali ei ark AF oe RF IIR AINE ESTO ARIE LENS St, ER SEA RMSE EP TN 
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OF heading the Itets but this speculation . 
Note that some lonsied relatione.ane detivebte unawalgubdsly from eurtéce, 

’ structure (for example, the English "1? possibte™’ ia bquivatent to “TH it, is 
possible"), be would expect to find.more ver tety. in.se 


for these cases, but they ere se shar le nen Ae wo ge be 


is pot Baepd.gn éxtensive analysis. 


antic representati ons 


<PROPE! VE -AS-#) omy: “hfs lg @.group of 
“compound tat: ggn oe + hana with ao iaees wihap "Consider the oie 
-dte Haas tinsehty th | Eee 
dow: Key’ 4 Jngteubent’ ee -teaoet: ohantine’ sail 
- here, Bf fren; to.open) eh Ti 
Here, we want to finda relation Ahat le the-tunetion. or Ahe-key, ‘bee.  t#OPEN 
WKEY WOG0R). _Thare ere a whole series of tteee’! tific tfona! © 


the relation Ateelf ie unspecified. So.once. Again, the th the deduct tye: comporient 
will have to supRY & typical relation uhen-given | ts. srqunenia. “Nate tat for 
the AFUNCTION-OF Gane, Weare not aking for s whole teks nectioviton,. slnce We 

‘ould expect “@-Knoutedgs structure Secnaenteteang: fife: mi teeern, = object’ ‘and 


ite function, oe eo eo apart 


. Compouncie do enhibit relations besides PPUCTIONTE nat we might Kant to 
“represent by MUNSPEC. .For svempie, eae 


tendhotfer «hand tuggege 
Alvaepe, ‘that can be carried by hard) - | 
der Kabinenketfer <-etesser trunk’ ttiterally, cabin trunk) 
(luggage ‘that can beused:in-@ ship's cabin) 


1B 


For these and other more assoc) ations! compounde,..it. Is. not. clear that there 
ig only one relation that cen. descr |be.. the. connection. between the compound’ s 
components. It seems desirable, however, for the deduct ive, companent to 
supply a likely relation.or. relations to. she spnantic.cepresentation.. Then, 
if there is no equivalent. compound or classifier plus neun combination in 
English, the generator can ves. the. sesentic: raprespntating. te..produce a clause 
or parenthesized explanation ta, deecribe.the. object, - 


5.6, Other Contributions of. the Dequctiva. Component. 

Except in dealing with. the, #UNSPEC. mecker, the.deductive,cemponent is not 
expected. to alter the senantic representation, We cen.mmpect. it, housyar, to. 
| Set registers on. sesentic nodes ujth.,inforgation that,sl|..be weetul for . 
generation, In per tisuler, this Antorsatign. would: inglude sategories. like 

Halliday’s known-yaknown and: ver iaple-velve: (13)...F ined -eheiese.. betuesn: the 
categor las: restrictrdesgt ibe. and gener ic-parsiguier,,wguid eiso be-pade by the . 
deductive component, ne u9l i. ae, denioions shout. coreference: An: the. generator 
becones. more sophisticated, there uj! | ne.doubt be: ether: gategor lee.. that. we 
Mould want to. add te.this list, | 


| 5.7 Deductive: Processing above: the Sentence: Level 
In. order to decida.hetween. sansntic, reorenentat lana, She: deduct ive. 

component will have to construct @.nodel.of. the text, .Littie: information. 
_ above the sentence level, is. currently. incerporated. inte the. semantic. 
representation, .since it ie not yat clear.bes such. infermatien, can. be.weed in 
text generation. Nevertheless, | think it is wor thuhdJg: to: take.@ Jopk-at 
_ some structural aspects of our.semple paragraph. .. Ag; ue, can.sea. from the 
‘English translation on.page.15,, the. example penagranh, cen, be divided inte. olx 
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sections. These-tdivi sions: cen: be cherseler ized’ as fol {cuss 
an Ulead painter: relating: aitet 19 to. com. es ‘what has come before 
Iv the: text: ae 
2. Whet 4t des: generel description.of the sechenisa | 
+ hs: Wnt: clhieee: HH He to: ebtvese: thecmechenige 
S. What goes with itr secompanging setione: ° 


6. L[seuet: can cephalopods. perceive..calon? 
In. this. anatyeiey: sections 1-5: ere cleerty releted, ‘anduore would expect them 
 to-be-hendied: using: spetia). knoutedge of thé: sey: on inebhe 
process or mecheniaes::. The. sixth abies je net: ‘dibectty’ related’ to the 
rest; and:we would:not™ expect: the: pr ‘tp lore: hater’ te deal with it 
directiy... The heque:nt color perception Ix hot, however, Unreléted to the 
other: tive: eectignes.:: The-baeic-reasonitg: here 1s, *Cephe topbde. ere color 
producers: Are: they-ales: 06/oF consumers. (Hiei perebivers) 2" The elath © 
' engad on,- there fer). Tovesti gates. the. invebet: of @ hey celetion in the — 
paregraph:. The-gasesi stéonal...ink-aptieen eset ton’ sli end: the rest of the 
paragraph is a fairly. common phenonenon. . Siti ter: asecet ational Linke mtght be 


c wduld describe. a 


used to intrassos-hietorical information or to mske points that ere: too short 
to. merit a new:peregraph of theie: own. From this @niétysie; we can conclude | 
that: unite peragraphs: ere general.iy. organi zed around @ single topic, we. cannot 
expect’ a strietiy top-down,. one-tobie-per-peragragh otgénization. One good 
heuristic weemsto be to-1aek for assoctetloriel’ | inke°heer: the: end of # t'ine 
of deter ipt ion: or -meneoning . Guhere: the she. a tia above’ wellld be. considered 
iinet of: deser iptionh:. | ny a | 

> dkny attempt: to-butid a model of. the sample. paragraph would afso have’ to 
‘recteate the reasining-used. One-of the patterne ne ese in the paragraph’ ier’ 


135 


(i) Gener a) £8 ions, x te a caves. of Ys 


Beog | Pit: aa 


qi Evidence: | _Exaeple where X X is “blocked and. change Ie in , ,!8 opeerved. 


“‘Thie sort of reasoning pattern. would be. associated sh gh cauge! is & and it 


wore Bhd ay 


could be cued Vexical ly wy *veran aecenden* _ Coaueing” line ef t the Sernan 


LaF RED, RAPS ETS 


text on page 13), “bet” (upon, here a cave relation, , tine. il), oabhangen 


2 Besa mip we wie any 
aa " , % (" effect" tine 
von ( depend on’ | line 13) a wine lek ayatigen® (ener tan an 


19). A variant of this seasenlea pattern - vhere “i nee er codes 
; general ization - - might be aN ightly. more a alttioult to nent. ait 1, poviouely 
| Poe to interpret evidence it we kno wnat, title evidence c of. it peqne 
clear _ at the (par egraph level, too, we oe Teiatai o8 Bpory 
to handie partial Information oo it is ‘eccugulated. said aien'~ este Gare 
hams} baad vet mas 
58 _Tateorerte Mohs we 


“Tost tupes of text ‘that would be conei dered .f or machanical translation 


2 SVE Y St 


probably would not contain phrases Vike, the, baba Lng. bregk” or * the raging 
storms 80 one, might cone lude that the oot ty ta haptle netapher |<. | anguege 
Js not impor tant in a practical aystem, . This B Mould ba. 1 thinks. on, Ce & 
unfortunate ‘conclusion. Hetephor is relevant, becauge it is part of, the pore 
general probies of the Steative vee: of Venaugen:, J metaphors. 4m ym taka the 


_., definitions of the individual Honds ) Invelved and.qusnend same of their, cuales, 


whi le transforming others slightly. Metaphor, | «24 4) aakian. in ybigh a. 
deductive component will have to reason by ane !egu,. Dot. ik, le not the. only 
niin Se Often, | We sep @ phenomenon simi ler to metenher, Up, Hor dae, Consider, 
for example, the word Erragungen from the first ADEN Af RT, RAF AG Op» In 

| @ technical sense, as it is used here... AD yr, A Reape. an alentrical.. 

. excitation, or Jmpul pe. ‘he: ah as pas enone, cscs eainen 
agitation, while in # social situation it. can aemn.gpametion.. He might. handle 


this by giving ‘three different definitions to ‘the word! Ecraguns, but then He 
risk missing the — ground ‘hat xiete between ‘these ‘three senses of the 
word. A knowledge structure should be ‘able to represent ‘the fact that the 
three senses are eneltagous: there is # disturbance of e ‘state of rest by a 
‘phenomenon that is smnetable or umpresictante in some core. fetaphor ic 
Janguage, ‘then, woews to differ trou ether sorte of tonguage use in degree 
rather then jn quality. eee 

“One possitis difference betueen metaphor ond. normal ienguege use is that 
metaphor is unpredictesie. en anew. mataphor ‘19 amowantered, we presumably 
have to call in ow anstogy processor te tind the pointe of sinitarity and the 
pointe that are irreievent: ‘Once the senses of @ word are known, houever, it 
is not clear that the ciosensess of the os te as isportant snynore. 
(There could ‘be the usual tonetite of 2 shared wots! - Rraaion of storage, 
aril fornity of Pepresentetion, “etc. - put qt Sa aot ‘clear that ‘ieee issues ere 
- pelevant.) This contrast ‘of metaphor ent “regular ‘fenquage ignores the point 
‘thet ‘Language usage hes fo be learned. ihen ; we encounter the werd Erragung in 
a technical senses for the firet time, wd ‘are sirendy ‘prepared with a est of 
possibii ities ‘thet <ah be evaluated te deteraine wich ‘rs ‘relevant to the new 
usage. To nendid wath metaitioric language ‘ind other sere common sorts of 
“fenguage ude, “then, 4 ieeautlive component ‘iil need to be able to inspect and 
aiter word detivittions’ tor soue wddel of + then, pind wit need ‘the gererat 
abi itty to henson by anétogy. ; 

‘A refated iveue hire is thet of “deed ‘metaphor. Some clichbe and 
idiomatic: expresefend have fost the fresndive Sf thelr enslogies, and one’ 
might conclude that the driginal senses are io longer relevant. = eat ieh 


ides of the original ‘eendes of these, 8! though Tt would probeb ly be 
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interesting to find gut. The point here La .thet we can stil! use such phrases 
even hen their original comparisons, ars, unknown of. lons..gecensten- For these 
sorts of dead metaphors, @ system should probably. bogk the phresas dicegtiy to 
their intended. meanings, uj thoyt worrying abeyt. spalogies.... Thum, the, firet 

, example waytd have roughly, the. sane. we; ning. M., "That! 18.,outlandigh, °. and. the 
. meaning « of the ‘eecond example Hould. be, ,Sboyt.the same 9e.; "He..was .antounded. " 
Tt Te not alueus. spay, however, to tel! wheo.e. metapher. Le.dead, end I 
_ think we should be epreful not. to, throw. sumy.tra,much> Canplder the.eityation 
of prepositions. For ‘the purposes of chepter 4. the hasig for eagociating « 


preposition with & verb or an adjective vas. Segated: on..ch! tears, Af one 


_, looks deep enough, however, there le often. a. compelling cageon.. for. the. choica. 


Consider the example. "sbhingen: yon. "We would. trang! ate,.th] a.9s..:dengnd 
_ but the literal meaning 19 “hang. down from,” . Hoge. .daneggenge ie formulated 
in terms of a physical 21 tuption, Uf ye.bang X from.% and then, .cey,.gove Y, 
thie will have an effect on X. Note thet the Engl eh, "depend.on” uses « 
similar physical enelogy, tut. it ie a. ehbghtly.diftecant .orm,,. that of .aunpor ts 
If X ig set an Y, then.@ motion in Y will affect X.en yell... (The Latin. 
ancestor of depend - dependare -sberas. the, hapg-dow 
is irrelevant here.) One could ergue that the .spetiel_enalogias here represent 
dead metaphors, and that hanging things down ard piling things up have no real 


infrom analogy, but .that 


connection to our thoughts about dependence. The more closely one studies 
prepositional use in both German and English, however, the more epatial 
analogies can be found. This situation seems to be especially interesting in 
the light of Minsky’s suggestion that a single mechaniem could account for 

_ both visual and conceptual processing (27). Prepositions seem to be one more 


example of the intimate relationship between visual and conceptual processes. 


ae oe iota 2 aes ai a ran faa 
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5.8 aciactinal the Baductive Compenen 
The implonentetton wees en eictremel: ineriecouel to. by-pace 
- understanding. “14 te deecrined here fer coliitateneas ently, since it will be 
obvious that such @-schene Would-be impractical ina werking system. The 
progres. requires Shat the uste undenetand the tant ono aaoly the. Information 
that would ordinarily be supptled iby a daduciive compeoent. By the tine al! 
thie interaction hes ‘takin place, the user could fi 5 Hike its thenetated 
the text by hard: Maverthelene, ‘thé teat ee. by-bene Foutine does allow us 
‘to get an idee ef how the ayetes ueutd pehave tf Ht were more complete. 
After the semantic cofponent hee preduced the poselbie representations 
the user be aekad tc decide which eoresiniiat te 68 iamatt Vike a) send to the 
“generator. thy typing Ina numtar), or C cosicalen dé tke te eee the 
generator try tia hand at all the peasinie. taprseekiatione in 2 
| Teapanding JKLE. Ber each represtntetion that le 40 be: ganérated, the user 
is ther asked te supply concept serkere Yor the ‘nodes ulth MUINSPEC. Finally, 
IVER and" PARTICHEAR “are presented, ‘and the user 
1s aeked to opec ity other nodes that are coretarent. “the other intarmaiton 
manttoned in section $.6 ts ret requested Aeueed alace ‘the generator is 
rot fine-tuned adl to use it. | 


ere 138 : 
Chapter 6 -- Generation 


6.1 The Process of Satan feneretion 
-6.1.1 Input. to the. Generator 
Having completed. ayntactic.and. seaantic, analysis ve an. input. sentance, we 
|are non at the point where ‘generation of, an Enqiiah eantance, can, bagin... The 
fireat question ig: just what the. aun, face. generator -ehoutd have ae. Ate input, 
and.in genere| the ansver to this is. not. IES bouLEs te AMAL to. work, ulth the 
senantic repcesantation that has been coun trughed,,..ndeaigntag the senentic 
representation, every effort.was mede..te: include -as-much..information ae. . 
‘poseible, with the hepe that this nayld. be. auf abent.tor the. gamer ating . 
process. As.ui)!.be.diacunsed halo, .the semensic.repragentation is..t9 fact 
not adequate for every eventuality, but if etlll constitutes. thems jor,..input 
to the generator. , Sta ane ie A 
_ One coulg. question. uhether. he, senpntis. raceliiens ‘a. te proper 

<input for generation. For exaapie, uben: trenglatiag weittep German into 
English, I find pyself using syntactic guidance... One. leok.at @ prenqminal 
-Clause like "die unter der, Hawt Iegenden.Zei lan" (Iiteraliy. -"the under. the 
akin. lyleg cells") and "relative clause" or “that” cameg.to mind. This use of | 
-_ language-depandent. translating. rules or heurigtice.may.be.a.personal 
idiosyncracy, or it. may, In. fact, be one. of the shorteyta that people often 
use. when translating, At. any rate, rules depending on. the relation of source 
to target language were not used in the trapalating. eystes.. There seemed to 
_ be no case where. thia was necessary, since 12, jung,g lunye. posed le.,to. formulate 


- semantic rules. corresponding to 


t suntectic ones... 


eoeee ee TT ace fe cas Sy ae ee eS - spe af een 
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6.142 Comper ing Generetors and Interpreters 
The surface generator wes wet tten dn 99 extended warsion of PROGRAMHAR, 
“Gince the interpreter ‘and 


and in many ways it Tae etaiter to the “fiterprete 
generator were puilt for tuo different maturat te 
then to oe tdentioal ona’ ine-tor-tine'bedia.*: 
even Lf onfy a -stngle lenguage sera iniéhied tere, 
gener ator ‘to bd’ a Hnesfor. tine inweres: of the aterprater. This te wecduee 
there sre different teneine and unkndune FoF tid tte ie igue 
“noth the lerterprdéter end -the generater ai tt heave: need Join bogense choices, since 
both wt! te using Hingal stte intéenatton se hn ipttenic ‘dherectertzetion 
of ‘the tanguages tavétved, But ditisrant: vatendat ton ures be- evatteb te at 
‘di tterent times “or ‘the tuo procesees: AA Hitarphétet nay ‘have to detey® 
several pracessing steps until @ loud! aibigulty ten eeresolved for, 
alternatively, ie try one of the pessihi| ities and. nedkep).” | generétor. 
of ambiguity et aif, efoce eff of the inert en | Ve ddeda ta°avat tabtecin 
unambigquove foré:tran the sonantic rer eae wtation * bed, Fam using tocel 
 gmbbiigulty to mean ‘jamb i quity that wittBe: Letitadeed ‘tid tine a pereer’ ‘hale 
“finianed syntectie ‘analysis’ of a serrtence; 


foal M111 t98)° bn adit tion; fii te 
the cheices sade tor: generating and itvterprdt ing be6 comperat 
impor-tenes of the eholows ul]! differs thé Iter petite: uses ite: \krieitt age of 
 premeaticat redihetoy: ord, TW ite better’ homtin te, euitantte: kel thoed® to. 
decide which cholce hee sete ty the suthor' ot an uttirehiee.” “The' gecterator, on 
the other hand, hess chérectert talon oF: “ ener litera. It mist dake 
Choices to conmuvicete ite message eect ively endh il th Tuck, grecefulty end 
unambiguous ly. le can therefore expect the tus acdasease to differ in! 


te, the retative 


relative timing and emphasis. 


14 


6.1.3 ‘Transtation end General-Purpose Generation ae 


TOK, eee} 


Several tines Slready 1 have. raferred tothe seeretira component ace 
; Sur face” 


_ | petor,. ond 1 should perhaps Slepify whet | on by. this, In 
; pertleuler, how goes, the generation pracene fev islgned he ' re, compere to 2. the | 
general process of uriting text in, one's own lenquaga? The big difference is 


ne can 


that the ), Benerator in the sranelating oyston is start ing. from & highiy seus 
epecified semantic representation which le in most coses {1 wil! discuss the 
nchanged by the ge ator. _ Revi lquely, 2 a lot of "deep" 
organization has strony been fone by the time, ough @ senentie representation 


Sahewe BiG webs 


_ can be produced. Je write 1 the 1 original per, per graph. tepals: the ee 


PSer ae 


_information hed to be ,aseemb led ond, . if jt wae, low-level, ae a 


. exceptions later on). 


Gr E 
ao 


& fe ee 


._ Patterns of reaeoning : 2nd argquaent , ned fo be. Spee, ond decisions about the 


be Pract 
relative Ampor tence ce of ¢ different Informatiqn hed $g.be fae: LEE ave ", these 


steps are not _nacengar ily. Indegendent,_ bu Ms t nau be. Uteked In in rather, comp! icated 


Mays. The Bur face generator in the otreneleting quetes m dows | does. mete. in most | 
r In the 
canee, have to consider tnpee choices, lng, they ere already recited In 


semantic representation. In this sense, then, ation for. tranelation ie 
- , MAP Haims bavi ps: 3 


easier than its more general-purpose counterpar t. 
I should note at in _cherecter lzing gengration for translation as 
general-purpose generation minus sone orn ol do not wish. Shad Suogest 


Bota on t 25 ae Hef yiaed? aris 


that the ponantic reprgeentat ion eed. here necessar || meters 


AW gf5 sats on x 


int med ] \ t i ati I t that 
nter iate eve! n he genera general tion pr cel int fact, 5 ‘popeet — 


ina ‘general-purpope generator, ie Mould. no t want to  preste au such @ highly 


ei Eies2 ways ie "REE RS 


organized epmantic  epregentetion. mile oom ignoring ng the lexical and 


Pr aggp tt: Bye 


: groemat ical level. of the _ target language. awd! t would s not mirpriee me, t ere to 


see substantial differences, between the organization, of of s genera! |-purposs 


BONETED 


generator. and the generator doer Pad Per Be ce 


Po Tb se, cet 


see BaP Re . 2 Cove ph i saa dTeteee ioe 3 ne 
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In addition to’ these di ferences, we’ can oy thet i generation for . 
translation cay te corel dered eestor ‘than’ gener él: ms ieds generation, in 
another uay it Te Nerder. The generatar dese not have te decide what to say, 


but ‘the other Half of the coin ie thet it hes €o tailor what Is generated to 
the intent of the original text. ‘het this text, of course, wae witten ina 
different languagé. 1f we can cap tint Ferguapes are: orgiritzed ‘to convey 
“influenced by the facilities evallaie ina given lénguaye. Word choice ie an 


meaning, we can ate say that the 0 


obvious example here - there is often not an sisotly equivalent word In the 
target language. ‘But the problew actual ty maniteots Itestf at ‘all levels of 
linguistic or'ganizetion. When a wissdich between Tanguages occurs, we often 
have to compromtwe, sueperding one gout to éthievs atother. It is true thet 
 elmiter comprosiisee met sfe0 be sade in geheral-purgioss Géneration, — 

eopeciaity in & situction were one ts per tiduterty donoer 


erned sbout style. 


(The worst case Here ie trenstating poetry.) Tre need for such compromise is 
much more frequent in tranetation, however, td occurs even in cases uhere 


4 


style is relativety untapor tent, 


6.2 The Engiish Grammer 
6.2.1 The Basic Shape of the Genereting Grewner — 

The Englien generator has three main parte. ‘There le @ group of ‘Engi teh 
syntactic opecial inte ‘for clauses, nouh groupe, prepes! tion groups, and 
adjective groups. In addition, # set of routines existe to ‘puftd, aaintain, 
and inspect @ “generation tree,” uhich records the progress of the genérating 
process to date. Finally, there are definition prograus sseciated with the 
concept markers. Only two standard definition programs are used, but we shall 
see that they offer a great deal of latitude in the fore a definition nay 
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take. The. generator translates. a single. santonge,at.0,tine,.a) thovph - there is 
nothing to stop it. from using information, about. the,genteyt of a: sentence, or 
from inepecting tha. text. that bps. alresdy. been generagedh one, Pat Ss 
Before going on, I should make a fen renerks on the scope-et the: 

generator. The system hae only a feu English dictionary: definitions at this 
point, but the routines for writing definitions are quite general. :the 
English syntactic specialiats.are not as.extemeive: ag their.Gerran 
interpretive counterparts. The reason ‘ee this ues besicailiy: one. of time, and 
extending the breadth of the generating routines would be a strajyntteruerd | 
process. This is clear first because the: exist ing: pnegeane: cancgendrate 
moderately complex sentences: subordinate clauses, conjoined. structeres, and 
rankshifted noun groups. Secend,. the. sajor.gape-in theckagtish:generator, 
most notably that it dose not deal with questions or commande; canst. plugged 
uith code that will look very, much. | ike: the: decterative code: thet:has been 
written. Declarative, interrogative,.and japerstive:peths Abrough: the = 
| generator : should chore o.jet of connor code, ae: they do 4n- Aho German 
interpretive grammar. In contrast: to, extendiag-thearasdth-ef the: generator. 
extending ite depth, Jee. - giving it: the skid tudo: sake: mere! informed... 
choices, is of course a more. difficult. seeks. Jap panwctic. types thet were 
implemented, however, pose enough quastions..for.@ start, eines in. Jenguage 
there are no truly “simple” examples. Finally, the routines for generation 
tree construction are fully. implemented, aad, they: ace the subject of. the next 
section. ates 
6.2.2 Additions to PRORAMMR a nant 

The firet. step in building the: English componest:sme to make. tuo: additions 
to PROGRAMNAR, A mechanien. use.necessery. for nuliding and.saintaining & 


SON ag OSE i USO I SOMERS De A 


144 


generation tree, and especie! functions sere needed to specify the nodes to be 
added to the tree. ‘The first tack wes retativeiy slepte one - generation 
tree nodes are definet’ ee carrying the fattouing internet one 

FEATURES. 

‘The eyntect te features ef the untt - 

ROOT 


If the node ta @ terminal one, the rest of the word 
tranet sted: 7 . 
| The phrase queereted fer the unit — 
SEMANTICS | . 
The: senentss. node for which: the unit ase generated: 
The inforwetion et: a.nede eny.0e reed welng: the Ancttony-FELE, PH-E, H-E, and - 
SH-E with the nate es erguuent.° Here, the “E" etendh Ter Engiteh end ie used 
te satntain the i#hetinction petweer thie generat ton tre and: the: Gernen parse 
tree. Gioke! ver lauies anslogous te those fon: the’ pare thee are naliteined: 
CHE, ME, -ete., with the obvious meanings, The uesesyb ver lable mecthenisn |e 
the same, ang 0m, with some etided coud, se the Sidhup fuNctloné POP and 
‘A node ig edeied to: the generation thbe'nuch #6 in parsing, did the basic 
function for thia ia TRANSL. When TRANSL is called a node le eat up, and if 
TRANSL succeeds, the node ie added to the tree. Just 0 with PARSE,-a call to 
ar the call nag request a 
lextcat undt tike NOUN er ABD, Uni the PARBE;" the’ ell te TRANSL also conteins 
the oe ot @ node in the senantlc representation fer whith the ‘generation is 
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to be dere: Also unl ike. the. perse tres,.. the. deughteen of @.nede..on the 
generation tree do not pave to be accumulated acendeee: 1t_is possible to. 
epacify that # node.he attached. snyyhere.in.the, |lat.of, 1m ynters.of the 
currently active node. “A call to TRANSL with a clouse..er. grovp-nane causes & 
eall.to the correapendin Eng bab poner ating. specialint, end. | ta.sucases, or 
failure. ‘deseruioes the euccess.or fellureof. TRANGL.. 1, TRANSL 19. cated to 
generate a single nord, jt. executes @: procedure: acegs lated. wi th. the. concept 
macker. of the O85, RSS, on. P85 $0. be trepaiatyy. « _ These individual propedures 
WIL) be discussed more fully below, eens eal. 2. 
_.. The generating. routines Invoked by TRAMGL.. to ‘eciia: greuns leak. yoru 
much like their. Interpretive counterparts.. The. pagic, statement type.is. etill 
MAR brenebh function. *:"s 9) thasgh: lini ted. to.8.tho-wey- branch, 
iss. it ls not clear pew the three-uey: branch. should.be..def ined for .. 
generation. The. inepection, functions, Hike. CQ-E and ic 
thove in.PARSE, ap.are FE, FO-E,. aod,celeted 4umetipns. thet add. inggragtion 
 t9.8 nodg, .At the. outest, it is nat. conplagaly clepr.that. it, 19, necessary to 
construct #. generation trae,.but holding m.to omtactia: Information. like the 
location of an adjective pr. object of @ prepeaition allqye the. gramer.ta make 
| decisions based on the structure of the ganeceted gentenge.et. that point. A 
tree. also permits sasy incorporation of. backup, should. the. genecation..procese 


run into. difficulty. 


6.2.3 The Generating. Process 

_ there. the interpreter. poved left to right. aver a sentence. Ahe,. gener | 
moves basically toprdoun through: the sonantic cancesentation... That. ig, . the 
generator moves top-deun: through the LINKAGE. regi eters, .and at.each atega, 
after participants have been trans! ated,...1¢ esquenges: thrqugh. the BODJELERS 
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links. An example might: be. the eesiéet: way: te exptaln: the transi tioris 
tie. for: 


involved. Figure 5.) shows the eementic represatt 
7 Der Cephalopod: bes! tzt ‘gelbe Chrometophioren: - The: cephalopod has yel iow 
“At the: top: Lavet the generator cat ie (TRANG SCEMIBE RAZOR TOPLEVEL SHNODE: 
- -RSSLI. uhich: in turn calla the eyntactic. apetiatiet CLAUREUE: Since ‘thebe °-£° 
tags: are no doubt: distcacting, 1 wilt. ignore thee (hoe here on. Ang routine 
“mentioned: in-thie.chapter. is’ part of theEngl ish companent ufiless otheriites 
noted. The main relation (here, RSGL) 1s elumye’ the tapsost Tink in the | 
senantic representation, so It le specitiad ih*the: életae galt. The firet 
"deci ston. in CLAUBE: ie’ whether to generate an interrdgative, fhperative, or a 
dectarative ctaune;: and the CUMUSE-TYPE reglatén’ot thecahin Falstion Is 
checked: for ite recommendations. Sine here tie éTausé‘ type is “statement, * 
the generator folfous the declarative path, placing the tag DECLARATIVE in the 
‘current node C. The ‘tag-TOPLEVEL indicated ‘thet: the ‘riddé’ gerieFiited ulif not 
‘have @ parent on the tree. Next, CLAUSE talis thé’ plirase routine VERB-PHRASE 
and uhich makes, thecal! (TRANBL VEROCAVEESINORE: ABSA Rote that In’ ” 
\ English, as in German, we will follow: Hudbori’and apeak ofa Verb phrase” 
inetdad of @ vert group. The generator tranelated the main verb firét, since 


it fs the key to the ordering of the dther tenetitudnts ih the clause. A call 
of TRANSL with NVB will return the verb node and also Heturne @ participants 
liet With the grammatical structures that the verb expects for its 
participants, The list has been ordered, taking inte account the objects 
required by the verb chosen, the theme of the bentérics, arid whether the verb 
is grammatically passive or active. VERB-PHRASE shéuld alvo go on to gerierate 
any euxitiary verbe and supervise tenses, ‘wat onhy the simplest active one 


‘pacatve constructions are handled right now. “ 
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‘The participants list returned by the main verb is in the form of a 
‘series of calls, and the claise routine need ‘only exetute it. CLAUSE does so 
now and calle: | G ne 
(TRANSL NG FULL NON BEFORE avefti‘wodé> SHNODE: 08S1) 
The BEFORE here indicatée that thié bu 
should precede thé verb node. just es 
nominative; other ‘English oyntactic: cases uted ae OBJéctive. ‘and_POSSeesive. 


nigroup,.since it will be the subject, 
erated. “the feature NOM Indicates 


FULL, as in chapter ‘2, nbeds that al rankoni teed oun row may bs generated !f 
necessary. In ‘trdotating’ the yuh! group, t ' 
COREF = register” Wer Fine ee a tbrbnbe if’ ‘in Tivbe4 eit 
will often depend on the uord choke whde tor. ite doeétterent. From here NG 

calls (TRANSL, NOMPSMNOBE-0S51). It is clear that-tt:upkee sense to | 


| transiate. ‘the nn In-8 noon wr first, since “Whe choice uitt.. ‘tend to 
? 


firat poneul te the 


ee 


7 ‘yerd choice” 


Influence, word-bhetea In the nodiFipre. Thin-te-mest obviously true for 


snot = 
t rae §, 


compounds, hich may absorb cone af the modi f tam For any sort pf noun, 


since we da net heve # one-to-one mepping betunen.sorde. ond concepts, the 
generator slay HEWN RO "Bd. eile orait ting to't nr 


7 etch betusien noun and 
nodi fiers. its eoens. nest netural to express thess raquiranante, in terms of 


the noun to be modified. At any rate, orey-tie sem hes been translated, NG 
goes on to tranibtte ‘wodi flere inte the appropetate ipre- and post-nominal 
stcuctures: using rules based on e¢uantic cabs to do the ordering. Gur 
example “der Cephalopod ‘Fés only é determiner, which is generated from 
relations: with covey SELECTION and turer, teking the catégory given-new Into 
consideratien. Since no relations renein, TRANSL. can collect its results, 
"the cephalopod, * in ‘the noun group node under PHRASE. Both the subject ‘and 
the Jerk have: ween generated, 80. TRANSL can call the routine AGREE. AGREE 


makes ‘morphological changes to the vert root so that it TT agres with the 
f 
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subject in pereon and number. Note that In English, agraement ia much lese 
complex than in German pnd that AGREE hes a.much easier. time than.its 
interpretive counterpart INPUT, since it deagnot have .to..go..sperching tor an 


unknown root. Goethe, wade gl 


x 


CLAUSE is now ready to penersita the pant. snteyln the. verb, node! 2, 
“shopping Viet, and it jurne out thet weyent,the direst ohJect. The example 
finishes up as CLAUSE makes a second. cal |. toNG 1th TRANSL, NG FUL OB 
SHNODE: 0552). NG ui! gengrate the agin noun. from O82, .then take care. of 

_ RSS4) to generate the.odjective yellou, ...Hren the dinect.oblect NG cal! hee . 
returned, the generator gathers up the resulta fram the PHRASE entrige ef the 
constituents, makes a PHRASE entry in the clause node, decides on iain 


and returns from CLAUSE. we A se FORKS eee 


Mao bare, 


This was pbylouaty, a relatively simeiq oxguole, She oamaasii 
grammar can handle wore complex’ ‘chins. He féed ‘fir nee ‘didcdestor, however, It 


might be a good idea to teke @ look at thé Weefionisd for ‘aid eho fe. 
og te ; ge as 0 se. 38 as: 
‘6.3 Translation at the Lexical Level 

6.3.1 The English Definition Routines 

When TRANSL’ 1s catled to constrict a terminal node In the_ generation 
consideration : and then retrieves en yr Engl Lah, et igh ttn grease 
concept. The definition routines ace .cqughly. gpalagque to the eenantic 
specialist routines in the interpreter. liners the ‘eenantic ‘special iets 
inapect the parse tree In order: ta ‘build a ‘gamantic: ‘capcesentation, the 


Engtish definition routines etart from tha. oonent ic eer esantahten:and use 
special Information to not only ‘eupply a node ‘for the ‘pegeration ‘tres. (and 
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hence a tranelation) but also ta specify other. nodes that will be required if 
this transiation:je to be used. The. syntectic shopping tist supplied by | 
detinition routinns shen a retetion.is transteted lscenatogous to the global 


var lables. bound by the Interpretive semantic epeciatiete to Sidi erguaents 
ta retations..in. the.sesentic representatiancs. 5 2s 
There are ‘tue.standard dictionary routines, TRANGLAREL, for . serene atin 
semantic relations; and TRANGL-OBJ-PROP, for téenelating objécte end 
properties. . The two routines ere quite elatlar, ‘except: thet tore. information 
must be suppl ied to: transtate reletione. It would propebly be helpful at this 
point to list .the-infermation that cén agpesr Ina ‘definition. ‘Starred itene 
appear only In definitions for relations, unite ‘the ‘rest may be used for all 
‘these classes. ee j wine = a 
ORODER« LEXACT cer -LEXPASS 


TYPE E. ONE, THO, etc. This apecifies the entries In the 
relation’. Hakage: register, that say be laft understood. . 


ROOT the root of: the translation... 


FEATURES: the part of epeech of the translation and then a list of 
other fapturss, Which are not. now beeen: to. mate. those tn the TRANSL » 
call - eee 3 


& 


WORD: speci tied for irregular. torne only 


PARTIPICIPANTS: @ var tavle. one or aaa procedure. that specifies the 
participants tist 


". CONNOTATIONBs currently ‘only one feature end ‘optional 
PROBABILITY: - fumber from t to 18 : ae E 


CONDITION: a procedure that must ‘evaluate: te nen=ni f Vt this 
definition is to be _Meed. 


COLLOCATIONS: “a list of parts of and root of words required. 
Thts ia currently used only for prepositions and “pactictes. 


"PROCEQURE: a procedure that Is executad’ Tf this detinitton is selected. 
It allous a concept to be transiated by more. than one word | and does. other 
useful things.” — 


_ ete), agreement, which yere di 
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_ A call to a definition routine would bora oa, be paca se 1, List of posalble 
definitions, each expressed. in terme of these hawworde apd their. values, . The 
_ boss ible def ini tlone first 90 through, a. Preliminary, rere, nf sile ination, paeed 
on the part of speech required inthe TRANG, call AL apalnat FEATURES) . 


Ered ‘G? 


a match. Qn connptations (1f CONNOTATIONS.,/9 sat. in, the, Saginitian . Jhate At 

must match the connotations in the sementia nada, fra on syalyation ef the 

_ CONDITION Procedure, Jif there is a In a aatinitions 
are also screened for ORDER (LEXPASS or LEXAGT). and, TYRE, (NONE. ONE, THO, . 

eed AN, panel Behe, After, Are preliminary : 
screening, a definition with the 0 highest PARP ULM le plcheds0r Lf, there 

are several definitions with this probability. the first one encountered is 


used. In the English dictionary, the ee eS are coreneny. pympers 
from 1 to 18, and reflect a rough estimate cm? pe 


should be tried. Probabilities could ie Moun be. tet fae 
procedures that check context on eile sn, 


its features are added to the feature. iat: ‘gt ry te: Bihar: dotint ton 
lists that passed the preliminary screening are phieea tn" the LEXALTLIST 

_ register (lexical alternative ligt) on the node in cage heckup la pecessary 
later on. If we are transiating @ relation. ite participants shonping Liat is 
also atteched to the node. Fing Liu the root of. the English word Le. returned 
to TRANSL, bch, tinloher, bul ding. the pge, GA! Ling aerate tony. cautinen, t9 
endings for ,the VpALWewal Marden ic ny an ore 


add the necessary ¢ 


6.3.2. One Hecker, Seeqeal sigan. bina a iewk Ses Bi oe ten ee 
. The last section presented the eizpiest cave, where 8 concept. peaeker., je 
translated by a single word. The definition routines must ateo be able to 
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handle. the case in shtch one marker becomes several words, and the case where 
several interrelated warker's become one word. For situstion one, several 
wecharilene are dvaitable. If the marker le a relation, the COLLOCATIONS [ist 
‘e used for prepositions and partictes required by. verte, adjectives, etc. 
Tre definition proesdures acceded this tist, and: Insert the remliced word In 
‘the correct piace tn the participants shépping filet. Fer sore colip! icated 
constructions, PROCEDURE Ie used. Aesué, ‘for the sake of example, that the 
conceptual structure contained the marke NUEFRIGND, snd we usnted to 
transiste the English as “neko friends fen ‘Brin ving to: oe this In the ouetee 
Te wlth a definition of the form 


kr 
a Cee on 


00], ’ friend) 


Here, EXACT means: that the order of the aérgumente of tlie verb corresponds to 
the order of the erguments of the concept marker. NONE mers ‘that no marker 
arguments have bean left understood, and mike is the root of the main verb. 
Winere @ phrase is proddced by @ definition, the uérd In the ROOT position 
correspenda to the part of speech TRANG. would be fooking for; ‘in’ this case, 
the. cal! would. howe been (TRANSL VERB MVB). Here, PARTICIPANTS ie set to a 
variable whose value in the system is a sertes of cal fs. fe PARTICIPANTS 
would: bes 
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(SETQ PREPOBJ-INTRANSA "((TRANSL NG 


bse = Cees AS 
“SIDE: « CAR WROSETSTE © EDL 
BEFORE: sfiret ee 7 


ROOT: ‘REDUIFED-PREPH|. peo Held Bre ited 
(NG SIMPLE 0BJ)))))) + 
ARGSLIST is an ordered list of arguments of the RSS to be translated. The 
ae i «ag % aero i} 4eSE o3 s " Ag 
definition procedure wil! set up a register “to “tio wlth to be subst) tuted 


for SREQUTRED-PREPY isnen thie preposi tfon “ts garétated: Note ‘fist here and in 


"PROCEILRE, seven ‘awe tat Sort ot bit "to Wn! “Uns RADY ‘Toph fed In 


a TRANSL éalt aforig with verte ‘of re ‘the WEF ieiT tion routines ‘are bu- 
passed, and the Hodes usée the PEATONES, ROOT. “aia, TF” gtven; fie ‘GORD “supp | ied 
in the cufl. ‘Stmiferty, FW Group of chéule fete be Wael ited, ‘Cheh The 
keyword GROUP can be used, followed by a list of TRANSL calie and other 

ai le to by-pass eyntactic 
specialists. Note that we do not heve te tally specify the words in @ group 


functions. The GROUP feature wanes Tt 


when using these features, for example the preposttion group from the 


dheety 


participants list above: 


( (TRANSL PREP mIED Pre x 
WORD: XREQU %) i | 
(TRANSL NG SIMPLE 08317 a 
SMNODE : <senant ic rode of second sient of 

MBEFRIGNDST ~ 


This call says, “Translate a declan #8, loki zs ‘the ica 


group." This weiliey to by-pass definition routines and eis specialists — 


is extremely useful, and allows us to write efficient definitions ine 


Ss oe ie eet ee ges BBR preg ee 
relatively economical manner. The one drimiback aight be the ‘number of 


Sa aa eae Ra aS a 
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features that must be wpecified along with the worties: but it would be 
whiny. Se Lt a 
writing, and to package the sont <anion types: Piteme in the same way 
that the participants lists heve senngenhngn Jae tl fteen standard 


patterns vould. sneneere capt nae yr pik the different ul ti-word 
situations thet come up. : pe _ Pg ae 


“gid In definition 


relatively esey to incorporate 9 .qnompti 


(6.3.3 Severs! Nerkere, One. Mord bitin tee 
OF acyeeed Involved .ena. free miraniee: to. eeveral. 
words. — Let us..noy take a. 0. donk. ot cue. tae. shane. sere. jrtarreleted:eencent 
nerkers wo toa sawn ent: THis, As, oe Mae : aie 


TT Flees G2. 
sentence, and then PROCEDURE ie uged to tall Gee, of the generator how 
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much of the semantic representation hes been abeerbed by the particular word. 
An example of a muiti-marker to single word translation le the. generation of 
the adjective viable frow the representation sboun,i9.Flayce 6:2. Here, the 
star indicates that, any semantic node may be in thie peej.tlon.,. Among other 
Information, the translation routine would conteine | 

~~ visible. 


_ COND! TION: 
‘(MATCH CFTND-LIMKs WPERSON) 


| Fence ms RABLEY mete eee 
(LINKS: SPERSON WEN) ee 
(REMOVE MABLE #GENERAL MOOIFTERS) = , 
Uhen CONDITION ie executed, the routine MATCH starts at pal place in. the : 
semantic representation currently being considered hu the. generator, in this 
case the RSS for WSEE. The MATCH routing paves, through .the MOOLFLERS. and 
LINKAGE paths speci fled, comparing goncept markers gaainat the ergquaents 
given. If a concent marker alone is nat enough 4 ai latinguish @ modi tiers. the 
_ Case may be supplied, aes dong for the MAVE-PROPERTY, patation above, 
LINKS, MODIFIER end FINO-LINK always refer tothe argumente, of the relation 
examined just previously. To change the focus of attention. MATCH may be. 
given a eenantic ‘node, as nas done ahove. MATCH te currently limited in the 
kinds of comparisons that can be made, for, exemple. At, je not nou possible. to 
specify that two uses of the ‘same concept marker should refer to different 
variables. Such extensions, however, would be straightforward. The matching 
process is not a particularly expensive one, since the devine of the 
representation ie aluays fixed, and. ue know. which nodes are to be heads and 
Which modifiers. Note that MATCH Is not the oniy eort of routine. that may be 
_used as accondition. CONDITION eucceeds or talle depensding.on whether Ate. 
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associ ated, procedure. succeads: or. fails. 

Once: the condi:tion’has. been: atl sfisd: and. the definition Is chosen, 
PROCEDURE ie exeyted:té tel! the generate not to try: to translate the 
modi t tor: HALE (uhere. the-case here: i6, given ae-AGENERAL),. since this has 
already been-useds. The-onty, point uantt seni ening ebeut. PROCEDURE le. that 
ite ef feote: mist: be: | imi ted: te. the-nede: currently: being:conetrunted by. the 
TRANSL cali. Thie:.is-necessany, to.-keam raanipralan ten eines. hen a.node is 
popped: fromthe tree, ae-went afl the.copsteslnte.|t hee-placed on the 
‘generation: process: to dleappesr. wlth: its: Thezti tations. on PROCEDURE are 
currently self-imeoeed:by the: dictiontny as ter inate makes no 
checks on: what: is-belng. eet: Soe 


The- to <detinitton-methods.. diecuased. here. dive: the: ‘dictionary routines a 
great. des! of powers The: wathod. deenr ited: in: ‘the: lout: eection: ‘al lous. the 
generator: to: trenetete:.a: single marker: inte: Clapei tise: Phu noun, or to a 
hoté clause Tf It: is deslced;: the methdd In thie ‘epction alious ‘the 


generator. to handle. relation participant: Reune..euch. ‘as indication, which ere 
defined. as. #50 | 


G plus.a. relation, In this. ies * deaathing that 
indicates.” We car: ‘combine: the: tuo methods to: ‘tranelete.a. plese of semantic 
representation inte:sore, than one English word, ubich @hould. be useful for 
idiomatic phrasee.like “Strike-uhile the iron ‘Te-hot." Given this general ity, 
the uriting of dictionary routines le peiattotly simple, ‘and it ull! become 


easier when more: stenderd. pant: end. procedures are bul it ‘into the. 


generator. 


6.4 Stumb! ing Blocks in the Generating Process 
In the example ueed to explain the generating progeas, ‘the generator was 
suepiciousty successful’ at each translating attempt. This is not aluays the. 
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case, and severa! probles situations will, be diecugsed here... Tha firet ies the 
need for backup, then the. iseue of repetition, end..the preblen of sentence 
length. 


6.4.1 Backup - : He : 

There are several cases in suhich backup |e neseesany. The most. obvious 
io the cass of lexical gape, or sukuardnese, in the tanget language. In 
general, if the firat translation. attonnted by -the..eyeter does not eucoeed, 
the generator needs @ nay to try.the altennabives..in-en.epderty menner. . The 
mechanism supplied to handie backup in the genenater ie @ fairly genersi:one, 
@l though 1 have not, written enough backup cede et-thie.paint..te.give mare: then 
an initial report on, its. perfarmance, . The. readan:,abowid heap in mind that I 
~ am discussing @ facility.prowided ta altou.the seetgner to incorporate. backup 
into programs, not. full-fledged backup wechenisn. lie as 

There.arp tuo levels in the generator were. ohoines, ore emp. et | the 
lexical level. and si thin, the, syntactic .spacieliate - There,ane:.therefore..tH0 
sets of lists that. ere maintained. as negiatere on; Nodeg, for, Reakp purposes: 

» LEXALTLIST (the lexical, alternative (iat)..and ALTLIST, Athe.ginvotural. 
alternative list). When a definitign is.eysluptad.snd, #, nond: Ie, chosen, other 
possible definition lists that passad the initial. screaning siencrlnen: ; An. 
section 6.3.1. are placed.in the LEMALTLIST, of. the. node being preated.. At: the 
structural level, every chaica..thet alight, be Jndependentiy, revered ble 1,9. 
would not imply automatic reversal. of chgices made sar! ler). Jeaves, tracks. on 
the ALTLIST. . These tracks consiat. ofthe function. in which. the choice: ... . 
occurred and a statement. label where code for. the alternetive. choices. pagins. 

If the generator runs inte difficubty (i.e, ak! goesibie TRANSL. calle at 
a given point fail), spepial backup routines.are cal lade, Jo, general. the, 


ao eer ASE Bt PRET Tallon oe, og 


first etep taken. 4i}1 pe to try to retranalete the relation for which the 
problematic semantic node is @ participant; if ‘such a relation ‘exists. 
LEXALTLIST’s are used to supply these nenalternatives. Backup routines do 
what they can to keep other participants that heve been generated intact, that 
is, to find lexical alternatives uith simi ler participants stopping liste. If 
no texical changes scive the preblan, a structural ciangs ie attempted. For 
thie, ALILIST ie seed, end an attempt i¢ made to'neveres ‘the most retent 
choice firet... The offending nodes ‘are potiped ftw the tree, and contro! 
returns to the routine end deel epectfied by ALTEIST. ©The’ syntactic 
spactalists are structured in such. a say thet tysitiess ulll procesd ss usual 
. af tera backup, except ‘that the ALTLIST. hee tee neduced-by one poselbi lity. 
For the Aimlted-number of cases ettempted: thie. backup échewe seen ‘to be 

general -snough....1tdass eet on the ssamption that structural choles Yan be 
fully order with respect to each other, snd thet ‘thie onder ing wht! not vary 


from one situation to another. Orily sore ‘experience with the generator wil! 

‘Geterwine uhether thie essumpt ion: ellos entugh general ty in ‘the systen. 

-  -Before-going an, 1 would Elke ‘to consider tie Ubi of backup in the ~ 
context of generation. It may be thet furtter rdseerch arch “lll point out ways to 

"cut dowh on backup.by deferring some decisions fo Tater in ‘the procées or 
antictpeting others. 1 do think, however, that dedkup will remain an | 

| important fachitty. in @ generator.  Untike ‘interpretation; “the possipie. 

tetibi ‘are tict fixed. ~ There 


surface structures for # given’ senentic tepres 
canbe no question here of yenerating "aff" ‘the: posdth 
backing.up, since with a facitity for paraphrase, il cout! no doubt go on 


' geherating poseitil ities ‘to rather ridicifous extreme’. © Tt might be desiraple 


‘tHttes instead of 


‘to generste, say, the three most [ikely textes! PossibT li ties at any given 
point. Thie probably sould not cause combinatérté txplosion, ‘sirice possible 


“3 something a little lose. repetitive, .. 


Mepe g ereaatee eee oh ay Bieapoaaie” ages i ce gard co ae a sa a oe a a 
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paths would be eliminated quickly. a do.ng} bave.enaugh axpacience with the 
_Wenerator to judge the lal i ail eta i What new 


” Jom.will .n9..deupt copeln bg 


mechaniems are introduced, @ DOCKYD MER 
impor tent component in @ text. gqoerator. a Pifeyes gn 


6.4.2 Repet tions / . PA cca Se irae, Sie neg ees ie 
The phenomenon of repetition So, goment ay ssylssisuerenien thet 
is, meaning le not Affected ser lousiv,. Repet) \$ loos GON, NaWANEr », be.extremely 
distracting. For example, the, first sentence Of the onae'e peragceph senteins — 
_ the phrase | "eln deutch bichtpares. Zelchen,". ofbe werd. ble$., fo, aL anSRRe here 
contains visible, peccaptibia. apd. axigants for. deugiicngtheJ!9t te... 
distingt, clear, blab. ond, ayident..,.. New, #0. |tbegnene.tple..!a.tcgneleted as 
“a clearly visibie Indication,” .put.ubat.1f for sege:ceapan. the generator hed 
chosen avidgnt. for sichthar? Mess fur thar. that, the distienery.onty...... 
contained: one’ tranelation: tor deailich and tut ona wae avident,...An...., 
“evidently evident Indication" sounds: Little. ike. Gilbert aod Sul tiven, ed 


the ADJG routine hed better retrenplate.. ants. te=try ste come ve With... 


Rowe yaeid aordd eine ue . 
Not all repetitions are aia hac wean For satel. toon, thats ine 
sentence would not. be noticed at.all, ubile, threes however’ s way wayld not. be. 
overlooked. In addition, some repetition.of.¢lavee,edructure ie. deniraple. 
6.9. where sali Gn lessons. ghs4c4 ldetlitpovlva ok imetius alttacwine in 
function tend to, Influence, Judgpents.eoqut. bas. 18 GanMEA Love. 

(1) He went to the zop to talk. to, the elgpnant.,.. ..., 

(ii) The probles with. .peoting.ulth, the Baer Mee eo 

is that Harry won't cope. — ee “a Soot ua re 


Boe dit 


Here, the two with’s in (li) seem more prominent thea, the. three, a's in, ti). 


. 168 


This eeems to be betaute the second to heeds af adjunct clouse rather than a 


prefiosition groip. ‘The tiiestion of unioh re 


itione eré repetitious, tien, 
ie not se btraight¥erierd ae it would aeons 

In considering repetitions, tet Us first sake the distinction betueen 
structural and lexical onas. No efforte vere made in the systes to handle 
structural probleme like repetitious. clause types, repetitious use of. | 
' conjoings structure, tind other’ ‘relitively high ‘evel 5 phenomena. “Heuristics 
to handis thedé sorts of repetitions ’ubuld be heobses iy Yor a complete syeten, 
but | gave thie fou'phlority here. this dellelen ies based on the seoumption 
” that enough of the variation In the deliated ‘tit’ sould‘come’ through, eo that 
pe barbin cobtes we wh ih i btitirat . 


 aentences in th 


ones that So ti tiret - ond it iit “7 y te haath Yn te ptraeeurat 


repetitions, ~< - OS he 
For lexical ‘repetitions, ancther ‘uéetul dtettnction’ is between function 
words and the reeti ‘Function werdi ira "Brbpeeitions, ‘Binders, and 
ch liable: Since they fora a closed’ det, evita ‘pe ‘preferable ‘to handle 
thes: by anticipation, Father thin Badiugs Lat ua sheeted the systen’s 
béhavior for prepositional repbtitions, ‘the ony heck of ‘this ‘Gort currently 
‘[eplemented. thei "a' prapost tion Te to’te ‘geneteted by ‘Hie system, the 
 prepcel tion group ‘epectal tet Feet’ Tooke hash tintough uket Hike teen generated, 


AG 


4 the’najor clase, 
\Uhichever cones first: “If otter’ prepae! tick greupe ire Vould, a Ilet of the 
prepositions is made, llth those that’ul i? Be Abdirbet ‘Es"our new preposition 
group appearing first. The cal! to TRANSL. al tess, teihg wee tag that we 
have not yet ‘discusssd: Pe ee 


_ 161 


UTRANSL JPREP woe (preph prep2..s)., SIE. sanomnt sea) 
The NOT: directs ‘the. Sictionery.roytine te tew to, ald theee any phoices. 

If there is fo gther ahplce. eallanles..the o| tyatlen.,|e nat cwrnently..fatel, 
but the. definition ceuting. wilt, true VE peegible,, ta eveelu 2 erepas! tien from 
the end of the. Mists. rather , theo. op igesine Fy Jpreponk tion, thet! 61, be. rear 

_the new one. in the sentence, The NOI fentucn: '9.evenanklu, lam tenented.eg!y 
for words, but with saeecevisigns of the,eyetactic apecie! Late ;it.could also 
be used to avoid structural repetitions like those horas 
peragreph. rete ate rouge eaiieisue acs 

For content, words, let us. mee 008 1 thea, sina: betveen ames of 
_ Goreferent. noun groupe end other .wordg. tbe ganaetes does apt non. Harty 
about repetitions aeaca gem croup. Jat AatameReletlulecune Maw L.teink 
Le lovolved. here... The,.eua ten euerentiusheaoterge.qntlens ola-genenating 
coreferential .noun.grougs,. Liccane}sher sepeeduas.gnoeating ‘nau grup i! th 
the ae determiner,..it-<on.yee:Shia BIMacSRORNR RRM EAt «th R ONIN Ie: 
pronoun. That is, the systen con refer beck to “a big pom octopus” as 
either: 3 espoarand seit re nee 
(i) the big.grey:octopue, ic: esiacu; 2 oi Agnes) aes 
ALLL, RRL ,0Ct ARE coe oe ne tiamun nie oT sal wees 2) 6 coe 

_ Na. checks. ae sucently. mode about ether. aabendain. Sem paragraph. thet, might 
have similar descriptiogs, A.sepbiatineted prregeie: yew! d -preeunataly be adie 
to _naloh cara fully: Lteqeatoga, shout velgg.nafacenne helene ira diatence..from 
coreferent. RQUN .. gc aups -paselnhtitien sof cnetvelan Im Aloncton te Od 5. 
repetitiveness. . pw sme cn. reget LALon -enengeara forse ROU GRAS» 

_ therefore, va hava..te,,be. integrated inte, Me noe Ham enpelatiot a 


_, carefully... ee 


apg poli nwo clabqwe fief gene so Jd Bhiscwig 


1g 


All the rast -of the content words Like Gur “evidently evident" example 
are handied using peckip. ‘The peurletié coriwntig tea le that if the root of 
“an adjective, nim; .or “verb is ripested atl corypither'e upatvere Viet an argueent eae 
main verb or a-welivitier of ‘the mein ction, tel 6A dittenpt should be sede to 
 @errect the eitustion. ‘This devotees @ etatininshd cuadh of LENALTLIBT ond 

acta: A this does not ork, 


then ‘an -attempt ito apice ‘the nndé .or nities select 
the repetition ts.currentty jeft as ls, tut “it undtd pe poselble to Initiate 
The heuristics weed here ere cbvieusly very siapte and limited ones,’ and 
| a 900d deat of retineent i111 ne conentihy: Sleuebly And easiest way to do 
riot sil tenlcdl nour latices a0 


this would de to:bulltd dries sheny. beele iethGets 
‘ponsibte, then run ithe system on shee tent. This.scdid give the designer a 
chance to tecete :the inupatitions ‘thet acted! $9 aeind repetitious.’ ‘The heart 
‘Of the preteen ‘hete -sedee to ae to: she ‘tite ieadiyelt, tied tee goad hur letics 
are found, “thelr iapteamntetton theull secetentgitttornend. 


‘6.4.3 Thoee Long German Sentences 
Sentence length is a problem that ssime-trivial smil! one starts to think 

about it. ‘Bound up in the question “Hon Jeng shetiida‘eentence be?" are 
issues of the sentence’s role .es.a.carrier of nasages. to ighange sentence 
“Hength we-heve to know shich ‘inferdation cin necbetely ‘seperatid ‘out and which 
Fe escerttal to the integrity of the demnape. Afeo. itvotved tins ane 
questions about tienen wetory ‘capnett baa “At ‘Het tofi “of dentential complexity: 
must take inte ‘account human shor ti tere eémorg “and proces lng | lattét fore. 
Gomething thet 16 conptex:to oe sort of aystes wight be handied wlth ease in 


<a slightly difterent one. ‘Thus, when ue <tr{ ‘te ‘adteréine unet size @ sentence 
should be, we have left surface generstion end ere reeily talking ebout deep 
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fet dee eae ay ot 7 ; 
ty eysten, of course, is based < on : ed tion, t ow tece, nation 
vom hats gener 


“et 


in the target language will be eufficient. [ peve, sequmpd thet. in in aepergl He 
do rot need to completely re-build the, peosege 2. fe the, target language, . but 

__ father can panerate from. the _comperat ively high Veyed.ef. a pepman tls... 
representation. tn most ceene, this poems to be the moet stficient, wav i go, 
and, since we want to follow the organization. oF the aia ial as 
possibie, it aeons to be the most effective ae watt “ot In dealing wl h, long 

: Sentences, I have cont inued to avoid true desp generation, and 1 thine only: 


fur ther ‘reeearch can determine shather, this isa ictye, or a vice Fs, cet We me 


sf pare 4 SPB TMs 
discuss long sentences, ina more detail, and then, I BIL, Cope, beck to. this, lseue 
below. . a 24 nee 
: Saws, PAE S whoo e ee FOr tas, me 
Firet, J should say that al though scadente Gorven ange a certain notoriety 


hep BE! 3 jars we ete 


i] 


for its long sentences, the _Bentence length Preble, l9. not unique to a0. the 


“\enauanes chosen, here. That I8, ye were. trenslating French { to Engl leh, 


bor aey 


instead of German to English, eentence ,Jenath. vould. Bee been. iseve, 


aE * 


Certain things are aiuaye easier ta express In Roi 5 8 age n grginens it 
may take a clause to express in one what a ord con,onprsee. bi the other... 


Because of thigs | it Ae possible to. tert from am ouncomn ligated sentence . in any 


source language and ond Up with either, a very, large. 09d complex, | translation or 


SERRE 


a | succession of vey oot eee ahs mheyid oe coppined O06 bee, 


flint 


In the oysten, I have concentrated on dealing with sentences that. are too 


long. “This was Strictly the result of tine Jimitation, | eince short ,eentences 


t 
et ee 


ae. tong-ones: {AL though poreinty wert ‘quktahegy’ He FAE dacher 3 oe 


oes esa 36 fet don. 
eomorenens | OU! Sys I should also aid that the negitattey in the oyeten ere 


“very tragwartery ane wre etted wore to ttridacs tng |) 0056 1 6s “then : “to “attempt 


ean oe: Just ae atylieticatiy often tre; se merotre pate tg igrectira 
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any sort of comprehensive anawer to. the probies. . ; 

The firet question that suet be. seked le "How Jong is too long?*, Which — 
should be immediately reformulated se "Hew complex ie too comp! ax?". It isa 
common observation: that iterative. structures ore senior for people to 
understand than. recursive ones, “Ludwig Reinere (a8) uarne against the dangers 
~ of "Klemmkonstruktionen,* or successive sabedding of clauses, So length alone 
is not the problem here, @! though often, of course, excessively comp | ex 
sentences are also excessively long. ‘Length ie: one tector ‘in complex ty. but | 
not the only one. The recurelve. structures mentioned, ‘in fact, exemplify one 
very impor tant. contributor te complexity. “Whenever a number of things are 
started but not tind shed, we can expect ‘the complexity. level to.be high. 
Similar, but not identical, is the compleni ty, introduced by back references. 
If we have, ‘aay, three back references. in the sane ‘sentence, we are not only 
fighting potential ambiguities, but ‘also ‘the. ‘overhead of simply saintaining 
the Linke ‘petuean coreterent i tone. The cian lantty: introduced oy ‘an 
unfaml liar word. or by comp! located senentic. content: ‘must also be reckoned with. 

Sentential compienl ty seene to. be an edd tive phenomenon. Some ‘wentences 

that I consider too comp! icated in English soon to. be. difficult not becaies 
| they contain a dingle ‘comp! cated structure, but rather as a result of a 
compounding of complexity. To ees. what I mean ty thie, consider our ‘prize 
example sentence end « transiation ‘that paral iete the eentent iat organization 
of the German: eo ae 
(i) Elin deutlich sichtbares. Zeichen thr die I= Nervensysten ver laufenden 


Eregungen. Lat ‘Sp lel der. Chrenetegheren der Cephetepedensjessr unter 
der Haut Fitbenay gelb, braun, schwarz, violett oder kerminrot gefarbten 


Zetlen, die eich eotyader: pashan eter Aah nadie aneetzenden 


Nuskeln #1. { aueyebre! tet wer 


“CY A’ctearly visible indication of the excitations that run through the 
nervous eyeten is. the play.ot the .cheometepheres ef -the-cephatepod, those 
yellou, brow, black, purple or carmine colored celle that fie under the 
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skin, and which either contract or can be spreed pur: under the Kin’ 
surface by radiaily affixed muscles. ine 


é This transtation woutd sound better 1 tad Eng Yeti ‘serttanices aaa used, but 
what about the foltowing? == =~ Wiehe? ath eg ab 2k. hae tan 


(il) Ein interessentes Belspiet bp itectag thanked wtlaaaale 
jener unter Haut Ilegenden gelb, breyn, schwarz, violett oder 
karminrot getireted 26 teri? die ttenni ‘iGdemens fetter ies caarch 
radiar ansetzenden Ruakel ny ‘I behenbet el tet werden kEnnvers 

TORO HIE. fo) cee Bob Gets 
(ii) An interesting ieee ty of this is. play of the chrcea tonnes ed: 
those yellou, brdun,; black?’ porpte’ oF téré térbtAa loved’ ‘othe! mat tie 
under the ekin, _uhich @ ther contract or can be epread out | —— the 
skin surfac’ by radlaity at ‘peels. (8 ETE bo 


To me, the English'in (I1) fe tine as'a singl eumntér 65° Ber thermore; 22 


not overly disturbed by a wontefite Fike: os Oh ee Be oe es Pater 
(1H) OA Clearty vinivie 1idléetion of the Wac! tat iene: that ren through 


the nervous system. Le hve me) of 
those vel tou, wow, b hata a 
skin, 


wos Syst oot & golsyrises 


Therefore, it seens, to ne. that fo one atructure (eay, ar an ont | c 
Oe sare, cle : can 
tip the balance in an otherwise acceptable sentence, but we can not pin the 
blame on the clause alone: it is the uhole sentence (at jesst) thet 
contributes to judgements about complexity. ou. , 


It is not alnays easy, | then, te, decide which sentences should | pet broken 


considered comp | ex except | ith respect to its. _fontent. 


AREY eect . ee 


down into smaller ones. In this respect, . L think nm ple have an easier tine 
of it than machines, singe they. heve .#. very direct method of Setermining 
complexity: if the wind koggles, It's tp complex. Computers, uhose minde 


ara net.. structured dp ne ; cane. wu. gon t bogele. in the vom Ma: , But — if 


We lack @ sndel of hymen wepory and progessing, I think (t lq possible to 
derive a meagure for sentential complexity, By analyzing sentences judged by 
people to be too complex, it. shoule. pe. popelils. te cope yp with @ formula - 


166 


acceptable and which -are ‘not. 

A lock. at the specific cases -handted in the aysten. would | ig be 
helpful, but before the "what," 1 must sgein cone i ser. .the question - “when. " 
-When -should the. transtating systen-aeke theme checks. ian Rentenne or more 
property mescage, -tength?. There. seen. to de-soveral -ontione. ‘Firat, there is 
the deep gerierat ten approach. If ae etert our. ‘ooreration process from a set 
of ‘senertions frou the data pave of the deductive component, we can include 
limitations on how ferge or deep a.ceuentic repress : ation should be, or what 
patterns of linkege.can. eccur. ‘Alternatively, we. aad Petein.e surface 


generation scheme but do complexity analys}s.on the. eampantic representstjon. 
He would be using the sane. sort of information {a1 though, probably not 
identical progvams) se for the deap. generation. apbrosch, but’ ue wuld be 
analyzing a structure after it has been built, not In the process of bul Iding 
it. The third approach is for ‘the surface ganecetor to konitor iteelf. If 
the output has tecome too complex, then sultetle action can be takén to split 
uildtinig dole: part of Tt in the 


the semantic Lichtarniiai possibly retort 
process. - 

‘I chose the third alternative for the ajsten, prtuarity because | 

age-dependent, aiid ‘geamtic 

representation Ie a gesture towird lenguége ind 


complexity: of expression is ta 


“fhe eluple 
Inspection of the eekantic represelitation, isd Gatwidt ‘télt iether a reletion 
can be expreséed by @ word, or iether a ciaueé ull! te necessary. In the 

“present system, it ls easy-enough, of Course, td detéraine wetter single 
words do exiet for a given concept. The polit te, fiwever, that we do not 
know whether a particular word can be used In the tranéletton anti I we take 
“into eccount both the interrelattonships betuest ‘parte of the meseage énd the 
linguletic constraints on ‘etructuring In the ‘tentente. 


,_ all the arguments of the relation. 
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represent t the anu i ase cirri dats ps a thigh thay 
suggest, houever, the direction in. which eng. ori ergpeed, . The, Fame 


example, agesin, was: nis So ies tare 


ada 8 Ti a sma ane, ge 


vel la ay Lacks aunts. oF eee thet, bile Lar the. 


‘skin, © peep ‘contr he - ree p OU 
Surface by ragial ly. aff) sed. ‘masiees,. pee R Oe Ge ce 


One factor here seene to be the eublect of panion, ean viaibie . 
indication." We notice that indication .is.« relation ner ticipent. . UN: and 
that the other argument of the. relation MLABLCATE Sthe thing Apsicated) is 
present asa preposition graup (of the, wach setters” be, ie en gall a aimple 
2 Powinal ization, (e.g. amy gerund) oF re Lagion bar tigipant Pan Luda if 
gore eal igi tly Ln the neun graun, 7 
subject of sentence (Dis therefore. watur, unrated, Fue thar», if,» saturated noun > 
grow contains @ postnoeing!.cenkeh! fted. qual it ler god tulng’ ong, of the. 
argunents of the relation, wa cen call it “empecgatucated.”, The subject in 
(1) 1s aleo supereaturated, due to. the 850 slquse.that sedi ties aypltations 
("that run thraygh the nervoye euatee)., We, ould expect .«.eunenseturated 
noun group to be a key conte iouter fo.sentential combexttu, but nate that 
ewvereaturstion along 19.00! kadailiaitas of ¢ sentence, (example 
(ii) above). TT En, ee Teen ee: 


A further difficulty with sentence. i, Ae the Sea eee that 
starts with “those yellau. +. _ and _ceralne celaced cells." Hera, tho 
Postnoming! FSO clauess modify galls. aod.in Eng! ieh,me mould tend,to conjoin 
the two using and. .Furthermore, the _gegpnd. ge ingninel,.clayee bere (“Hhich 
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either contract or canbe epread out ...") “is iteelf conjoined by either / or. 
This double level conjunctton fodkis-| Ihe ari Tipértint contributor to 
complexity, but it is difficult to tel! mire to Gre the line. If our second 
clause were “and which either expand or Sia ai ‘the ‘apposl tive vould 
probably net be too long. In (1), Rbvener, 


> the ‘tuo ‘clauses ere not peraliel. 
(The firet ie active, and the second is passive.) This seen to push the 
snpositive over the ling. Jet ue eqy, then, that af qupositiye can be 
considered complex if it goes beyond tud levels of paraliel conjunction, or 
eyond one level of non-parallel conjurictior. ‘Onoe ‘egein,: 8 complex 

apposi tive stone should not’ trigger aephinalh ‘of a sentence, “since example 
(11) above doses not seen to be too compton, | 


By combining. @ supersaturated noun group with @ complex appositive as 
defined above, tin do get a°éentende that 16 toe complex. In déaling with such 


@ sentence, we Hould expect a geferator to interrupt processing if it detects 


an output that ‘satisfies both complexity conditions. “In the case of example 
(1), nodés would then be pofiped from the tres back ‘to thé beginning of ‘the 
appositive, since .sppoeitives and conjunctions ere natural places to break up 
@ sentence. What fe feft ‘on’ the. tree (in ‘this cues, ‘everything up to “the 
‘chromatophores of the cephalopod") ‘would betone a stngie sentence. The 
appositive part of the sementic representation aould ‘pe Setached and would 
heratlon starting from ‘the top. Note that 


become the second sentence, with’ oe 
given the similarity in-word order betnsan EAgl leh saljor end secondary 


clauses, it might be feasible to rewode! cone of the output thet has already 
generated before the interrupt, Instead of diécarding it altogether. In eome 
cases, however, thie patching might require a certain shount of Ingenuity. — 
Note that not al! long sentences'cari be oplit'as sastiy a6 the example 
here, and, for some, ue wight not be sble ti find @ division that maintains 


Sa 5 ce a Ss 
: ESS 
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__ the Integrity of the message. Stli|,.1 think thin suns. giver, an idee. of 

“gone of the lseuae_invelvad in handling. sentantLal: somali... L sheuhs, #100 

-renark that thie emphasis on. ain. suntan sar\ter)9, te,formuate beuriatice | 

(may be miequided. it may be that semantic factors shoyjd ba. canaldared ... 

heavily. For translation, however, I aula | tend. to, taypr,, heurietlaa, prnreseed 
in teres of the, surtece Strycture of . ae language, singe 8 3 sien 


; S 6. “Other  Yecensry stenee.. 

- Even if it were extended to handis, tha, full. range af; _nnapetis, 
structures, the. generator discussed hens, sould note pald.tp be. somplate 

one. There are still a1 EES AER POE 


ai | scuseed tet aemlk ees aa ¥ pocmven, Aer Aet gralet 

addi tional! abr ati would be necessary. 

6.5.1, Dealing with Ambiguity tye herameid teed Wiig + eee 

Ambiguity. ie not on lame for. the. generster: ip. the.aape way: that 1t.ds 

for the, Interpreter,. but. there re, severad.capadene: ward. cone der Joge: Tuo 
mechanj spe. tor end jng, different aneta.ot: appl gnity: will be diesyened bares — 
one of these. quid be desirable in. 4 working. trenaieting systen: end) the:other . 

would be quite impartent... ay at sa 

The, first, feature-is, te sility to. tranalate.apbi gu) ty: fer-ambiowl ty, 

“uhich was. mentioned. in, -tig; intenguetio. . 1think Sin. wgy) é pevecuae tod. 

mechanism since I find nyse! f doing it .necpelensl iv: ubepshend tranetating.. 

For people, such on.epi lity. ie uged. ast foe prengyp neferenes nnd bee icit 
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information |ike understood agents, etc. For a translating system with @ weak 


"deductive component, the ability to. translate ambi gui ty-for-ambigui ty might 


Ble0 be. used to avoid choosing betueen different senses of @ word. Especial ly 
in tanguages as. close historically ne Corwien fod ghia 1 it ts often possible 
panna In general, @ austen would Rave to be quite ie heaticd’ to” 
translate ambiguity-for-sebiguity, but et the lexical level, we could make a 
start with either an English interpretive dictionary or uith an associative 
ability to link concepts to. definitions. (Currenty in the system, we can eek 
uhat concept markers are assoctated with a Germén uord or what English words 
are. aseociated ‘with @ concept marker, but there ‘is no way to find ‘out caeily 
what ‘concept worker are aseoclated with. Engtieh worde.) It would be a 
“relatively simple matter to take two concept markers produced for # Gernan 
uord, Took up a set of English words asect/ated with one of them, and soe If 
any English words ifr this set could also have the other concept marker as a 
caine seein xcpteanas & =a 

To translate aubigui ty-for-ambiguity beyond the lexical level, the 
generator would need a mode! of the English interpretive procees. The task 
there would be to enalyze the way the target ténguage iis imbiguous by 
inepecting the semantic representation oF ‘by accessing pre-packaged knowledge. 
The pre-packaged knowledge might’ exprevs such’ facts aa, “In a German | 
~ hominalizatton of @ transitive verb, the genitive could be elther the subject 
or the direct object of the verb, ‘1 no other participants are given." The 
second phase of this task would then be to’ tind an Engfish structure witha 
similar ambiguity, which for this case happens td be Sipe nominal izétion 
with en of. To usee fenitiar example, : ay ein 


das Schiesesn der Jlyer = the shooting of the hunters — 
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("the hunters. shoot" or "the tunters re. ant") es 
The mode! of structural sabiguition might. be. der, | ved. eutonatleat lu from a 
parser of the _ language, a! though thet would. Apt 29. 90y £9 the -par.ticuler 


Oe HF mul. gant,.euch 
hnou edge about anbiguities to be detached . from the. pageer... aince we. do not 


parser used here. lnerever: Jt comes, from, “ad 


| went to have te simulate the parsing, Progees. avery thee yp, ent te determine 
| unether a structure is aebiguous. paltegwig aiAcainge 
. The ability to. translate ‘en gul ty for oni gul ty dst diocabie would be 
attractive ina ‘generator, but not Rebaeeihtb tie The pneand | feature | T wid 
discuss here, anticipating Sian lee in the output text, is more niet. | 
think, since It relates to the reliability of @ translating eyetes, What! 


would like to consider is the situation where the, onnerster. yeaa: 


a: The mont extreme 
step one could take to avoid thie Problem ould be £8. feed oven t text Puseuk 
contained _ 


produces a eyntactically or _Senentical ly om quous aN 


back Into an Engl ion Interpreter, fo see Jf, hat yee. aan 


serious np igui ties. This would correspond to s _trgnalator reading. over. and 
correcting a translation. It would not be an absolute aemurance, since, the 
. proot-reader would presumably shere the cane deductive data pase used by. the 
rest of the system, and knowl edge Lint tetione might couse. At. fo, Biss 
ambiguities that are present in ‘the. output text. a iti tee ha 

Even if euch a proof reading facility were s aveilenie - and. sopecially if 
one were not ~ a generator should also be able to anticipate sone. ambiguities 
and avoid them. In his thesis 9), Hii Sasol oan. four, causes of globel. . 
syntactic ambiguity in Eng! leh: 

wD) Choosing & betueen perticiple, and gerund 

(2) Choosing: betneen noun and verb. in Clause first word. position 

@ Choosing the correct _srene) tivity for. the, Yerp: . 


1 cae apt Seetens Yeti ate 
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A4) Choosing an attachment point for a modi ful. phrase . 

It te ctear. that, given the febturés saved on ‘the generation tres, checks for 
these sorte of mistguiti¢s would be relatively. stralifitvoruera. ry gener ator 

aleo might be abté to anticipate certain. eenentic: ‘confusions ‘ebout the: scope 
of a quant! titer.” “(*Quanti fier” ts used here es on Erigi ish pert of ‘speech; 
gee Winograd (39,p.67).) Checks for these corte ‘of ‘sbiguities could be ‘built 
into. the eyntactic epecialists of # generator, or “the one inforaation ‘could 

be embedded in e sual! routine: that woot ‘ponitor the: veneration Process: -— 

interrupt tf ‘one of. thees emviquous « structures were: - varerated. | 


(6.5.2 When Alt Glee. Fai le 
Inthe Tupteédientation as it. etendy. ‘non, 1 alt ‘he: beckup possibilities 


are-exhausted for a ‘particular. seritence, the generator “elapiy. fails. “This 

~ would obviously be undesirable behavior: in aierking ‘trane ation. eysten, end 

we Would want -@ systen ‘to be ‘apie. te neke the pest out A * bea situation. 

‘There ston. to be tuo directions. one: ‘could. oe ‘te enet this oat. ‘one being 

compromise and the ether paraphrase. “Beth features would: be ) desiraie | in r 
system, but it.is net at all clear how sueh | behavior cout be produced. The 
two will be considered briefly here, but 70 solutions ati be ottered, since 
they depend, 1 think,.on extensive further ‘reeserch. . 

In ‘the current. version of the generator, ‘peckup te hend!ed by trying 
alternative cholees, but thie is alueye ulthin the Context, ‘ofa ect ‘of fixed 
cholces; no attempt is made to suspend niles” ‘In sctuel ei tuations, it is 
entirely possibie that no combination of persicinie choices adequately. | 
“translates the original, end, th thie case, ue. would want to Produce the best 
approximation possible. This could be done ‘either wv ‘leaving out cone of the 


content of the original, or ‘by violating o one or ‘nore "of the rules of the 
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_ target language, (The other alternative. je. ppraphrage, .yeich |p discussed 


below.) In general, shen translating, sclent)tis.texte.wp.u).|L.enener to... 


violate rules rather than to lepve ovt content, .. Thaghelee ag hich cules to 
suspend, however, can be a difficult one, and in arriving at this.deciglan, we 
might get invalved In delicate trade-offs between different possibilities. 

The generator would probably need the ability to produce a set of alternative 
trees for which different rules hed been suspended and then, om, gous set of 


criteria to deternine which represented the pest conproe ise, Thhe | @hi lity to 


tHE s . 


make compromises vould be Important in a york Ing daa ta aines ,! it often 
aeons that the ssvence of translation, Js the fel lity | 9 0 FInd gagd compromi ees. 


HES 1 Ly 


If no satisfactory compronton bent be found, the yout, step de to try 


me per? 
paraphrase. 1 am using “paraphrase” ine ® ) apacial sence net bace, te eapn @ change 


In the wolielt mening, although | ret Ave the tgtal franina ats a text. bet me 
sive a simple gxsepte of a Bl sues lon in nigh wel lcit weaning ditfers aut In 


ef $e Sh 


| which ¢ total meaning. le equivalent. ; othe Ergligh phrage 'e * dc ieaiiglt 


in Gernen ae “ein hal ler Nag" (literally, "a bri ts *). Py av i lously, if 


sea gp ee eres] 


; oky is » clear of Cloude, then the eun can ee if 
__the sun Ie bright, ue expect the sky to be clear. SR AMP cLAEMRIRE Pick. uP 


on differant ands of t thi Ufspnd-on ive | relationanin,. Ate, Alshevdh exp)icit 


statements differ, the implication ts is still | repel eguivalant. Bo 


This particular exanp!s would prokab ly past. .be. hyralse.n ale sonar. 
definitions, by translating hell as clear under certain circumstances; we 


probably would not use @ general peraphraese mechanion. Not al! possible 
peraphrases, of course, can be anticipated in this way. If the generator 
cannot translate a phrase, ne will want it to consider the implications to see 
if another equivalent phrase can be found. This will Involve a return to the 


deductive component, since only 8 fraction of the implicit meaning of a 
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sentence is carried IW the senentic repivientation:. “The deductive comporiont 
would precumanly fitid’ an equi vetent’ peteptiese, ‘elter ‘the ‘semantic. 


representation, ‘end eend: thle ‘ven rear P 
snother: try. 


_Al though the: gentretor currently. inp Hemented is very Halted In scope, q 


have tried to féredee the kinde of extensions 


“that would be receasary. The 
lspor-tent features ef the generator ere thet it uses: ‘the senentic | : 
representation ae tpt and that it Ie rot constrsined te generate components 
In the: | ineer order that they iit opneer ine sentence, Ytth the extensions 
made to PROGHAMAN, ue can maintain « generation tree, and It is possivie to 
” py-pase syntactic ‘gpecielists or dat inition ‘routines, ‘to “epect fy ‘& definition 
tet to be used, Of te explicitly rule out: particular ‘word: choices. ‘Enpnasie 
has beer placed” on sone propiead: thet we ‘traditional ly coneidered ety! lati, 
but wich are. of coneiderabie: inpor'tanos- for trenstetion. A greet ‘deal nore 
anelyele needs. t6 be dene on the senantic notivation for particular syntactic 
chelces, and the: ‘generator would sieg Benetit trom. Anwactigations of problems 
uch! as repeti tiveness and sentence idngth. ‘Fined iy, the feeues discussed in 
section 6.5 - avetditig. subtgul ty, euspending rules, are | peraphrace are 
provtene that are uide open for tur ther’ r research. ‘ 
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Chapter 7 -- Conclusions 

In the course of describing the. implementation, I ‘have diacussed some 
,problems and sope solutions. Not al! of the probjage. have, bean, satisfactory 
solved, and among theese ere sone, | think, that ere Interesting enough. ta 
justify more intensive research... In this final chapter, let ea review what | 
consider to be the major problems encountered in thip project. and nake some 
» remarks about the different solutions. proposed. 

Heading the list of problens in mechanical easiation Is.etlll, of. 
course, the problem of underetending... This was..outside. the..ecope. of the. 
project, but I want te emphasize egein here.thet true underatending of the 
source text Is crucial to trustworthy translation. .. Related to this is the 
issue of accountab) lity. A user should always be eble. to ask a eystem what 
choices were made and. why. Just es @ human trensleter.could give.reasans for 
@ particular dissmbiguation or word choice, | think it. is easential that e 
systen be able to do. the sone. This will not guarentee. reliability, but .it 
does give the user gone contro! by giving hia.s chance to. catch gaps in the 
knowledge base, incorrect assumptions, etc. | 

The first problem encountered in the jie conataaes was that of pecaing 
German text. Here,: there. were two difficult erese ~ German inflection and the 
relatively wide (compared to English) syntectic variety, |.9..prenominal | 
clauses, the relative freedom of uord order for verb objects, end-order 
conetructions, etc. The former involved chenges. to. PROGRAIMIAR to handle. 

_ multiple feature lists. These changes. uere extensive, although of a routine 
nature, If PROGRAMMAR had originally been weitten to handle Russian. or . 
Icelandic, the morphology of German would not have.come se. such. a shock. 
English is biased almost exclusively. toward word order in the linguistic 


trade-off between mor phophol ogy and word order, #0.one. would expect. to need 
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fairly extensive changes in order to handle a more heavily inflected lenguage. 

With respect to actual parsing performance, I would say that the general 
approach used by Winograd ‘Ts as good, at handiing’Geraan.as It is at handling 
- English. ‘That is to say, the performance of the pireer leaves no doubt that 
it could ‘someday be ‘extended and eabecied: th a prectical’ ‘system. There remain 
certain trouble: ‘spots, however, that are present in Engl ish but are 
exacerbated in Gorman. The tnptenentation uses 8 number of ploys to deal | with 
the more varied syritactic cholces of Gersan, but it 18 not. clear ‘that the 
solution te general enough. The recent dissatisfaction eorevied about backup 
In language-parsing seems to be well-founded, and it-wil! be interesting to | 
eee: the ‘résults In this ares. | 

A key question in mechanical translation - what. the Input to the 
generator should took tikes In ‘designing the inp\émentation, I sterted with 
tuo assumptions sbout this issue. The first wee thet for ‘a:given target 
language we can predict ‘the sorts of information that will be necessary tor 
generation and the sorts that ultt not. Second, I sesumed that our face 
generation would be. enough, that the generator could follow the general 
; eraenizetion of the source text sentence tor sentence. ‘These ere based, ba 
turn, | on the under ying hypothesis thet translation of: sclentific prose does 
not need the full pouer of & general purpose generator. Adupting these 
assumptions resulted in a commitment to the use of a ‘tenantic” representation 
as input to the generator. In chapter 6, we eau that these aseumptions do not 
aluays hold. Some situations require paraphrase, and in others we might have 
to restructure the weasage entirely. I think: they are true often enough, 
however, to justify substantial ‘differences betueen the form of generators for . 
translation and general purpose generators. | could be wrong in this, a 
hovever, end only increased research will tell unether effictency ties In the 
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direction of a special purpose or of a general generating process. 
There are many aspects of the two languages considered here that I would 

_like to have investigated in more detail. All of these, I think, are 
interesting research areas in their own right, irrespective of the 
Implementation involved. Only @ token gesture was made toward using 
information about collocations for parsing and generation. It would be 
interesting to see hon this information could be used to build up lexical 
expectations about the rest of the sentence, in addition to the syntactic 
expectations currently embedded in the parser. Furthermore, the area of 
. generation poses a number of interesting questions, many of which have been - 
given only rudimentary answers here. Issues of word choice and sentence 
length deserve more attention. A good deal more analysis needs to be done on 
questions of semantic motivation for surface structure choices. Finally, 
another very interesting problem is that of suspending rules to make good 
generating compromises. 

Throughout this project, I have been continually impressed by both the 
economy of natura! language as a communication medium and the variety of its 
mechanisms. I find this convincing evidence that any translating system that 
throws away information, be it syntactic, lexical, or semantic, cannot hope 
for success. In the end, only a total approach to language will offer even an 
initial solution to the translation problem, and a lot of intriguing questions 


stil! remain unanswered. 
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APPENDIX A. WORD FEATURES 


WORD DEFINITIONS FOR THE INTERPRETIVE GRAMMAR REQUIRE THE FOLLOWING SYNTACTIC 
INFORMATION. 


STARRED FEATURES INDICATE REQUIRED MATCHES IN INFLECTION. FEATURES WITH DOTS 
INDICATE TYPES OF FEATURES: 

PERSON= P1ST P2NO-FAM P2NO-POL P3RD 

GENDER= MASC FEN NEUT 

CASE= NOM GEN DAT ACC 

NUMBER= SING PLUR 


ADJECTIVE, EITHER ONE THAT CAN BE DECLINED OR NOT, 

COMP =FORMS COMPARATIVE 

SUP = FORMS SUPERLATIVE 

IF A SUPERLATIVE OR COMPARATIVE 1S ACTUALLY FOUND THEN THE FEATURES SUPERL as 
COMPAR ARE ADDED. 


ATTRIBUTIVE ADJECTIVES: 
(ADJ ATTR OECL COMP SUP) 


COMPLEMENTS: 
(ADJ NODECL COMP SUP .CASE. ) 
NOOB J 
ADJECTIVES THAT MODIFY VERBS OR OTHER ADJECTIVES: 
(ADJ RELMOO NODECL COMP SUP ) 
THOSE RELMOD ADJECTIVES THAT MAY NOT APPEAR IN THE FIRST POSITION IN THE 
SENTENCE: 
(ADJ RELMOD NON-FRONTAL) 


POSTNOMINAL ADJECTIVES: 
(ADJ POSTNOM NODECL) 


WAS FOR: 
(ADJ INTER NODECL .CASE.) 


WO, WARUM, WOHIN, ETC.: 
(ADJ INTER) 


WORUBER, WORAUS, WOZU, ETC.: 
(ADJ INTER WO-FORM) 


ADVERB MODIFYING ADJECTIVES ANO OTHER ADVERBS: 
(ADV) 


BINDERS: 
(BINDER) 


COORDINATING CONJUNCTIONS: 
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(CONJ) 


DER DIE DAS ETC.: 
(DET DEF .CENDER.- .CASE. .NUMBER. 9) 


DIESE JENE ETC.: 
(ENDINGS DETERMINE GENDER, CASE & NUMBER) 
(DET OEMADJ) 


EIN KEIN EINIGE: 
(ENOINGS DETERMINE GENDER, CASE, & NUMBER) 
(DET INDEF sienad 

PL! 


WELCHE : 
(DET INTER DECL) 


WESSEN: 
(DET INTER NODECL P.GEN) 


MEIN DEIN ETC.: 
* (ENDINGS DETERMINE GENDER, CASE & NUMBER. SEE POUGARAGT FOR P.) 
( DET POSS P.PERSON. P.GENUER. P.NUMBER. ) 


JA, NEIN, DANKE, AHA: 
(INTERJECTION) : 


STRONG NOUNS, THAT IS THOSE THAT HAVE A REGULAR: DBCLENSION: 
(NOUN STRONG | SRR. ones NOPLUR } 


PLUR 
IF THE NOUN MAY TAKE MORE THAN ONE GENITIVE OR PLURAL ENDING, THESE ARE LISTED 
SIDE BY SIDE IN THE. FEATURE LIST, RATHER THAN IN ‘SERARATE GNTRIES. 


WEAK NOUNS, LIKE SOLOAT, MENSCH, ETC.: 
(NOUN WEAK .GENOER.} | as 


MIXEO NOUNS: 
(NOUN MIXED .GENDER. ) 


NOTE: ALL NOUNS ALSO MAY BE ‘E2? THER COUNT, fHOS°0R-PROPH: (PROPER NON) SHOULD 

A PARTICULAR NOUN BELONG TO MORE THAN ONE OF THESE CATEGORIES THE FEATURES 

engi BE LISTED TOGETHER IN THE SAME WAY THAT THE TRANSITIVITY PROPERTIES ARE 
OR THE VERB. 

WEAK AND MIXED NOUNS MAY INCLUDE GENITIVE ENDINGS IN IREULAR CASES. 


CARDINAL NUMBER: 


tech De ae a 
BO ge eases ge, te! 
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: (Nutt) 


DIE MEINIGE, GIF. MEINE, ETC. 
-—_INEORMATION PREE 
UNPREFIXED 


EVALUATED. 
(POSS-SUBST P.GENDER. P, PERSON. P.NUNBER ) 


PREPOSITION: | 
(PREP ..CASE. .PRE ) 
-., POST - 


MAN JEMAND ETC.: - 
(ENDINGS SUPPLY CASE & urberi 
(PRON ABSTRACT . Sal yo 


(PRON “INTER CASE. ) 

MEIN- ETC. : 

(SEE POss-SUBST FOR EXPLANATION OF P. ENDINGS DETERMINE GENDER, CASE, + 
NUMBER. ) 

(PRON POSS P.GENDER. P.PERSON. P.NUMBER. ) 


DA-COMPOUNDS: 
(PRON PREP RELMOD) 


ER SIE ES ETC.: 
CASE MAY BE EITHER (NOM GEN DAT ACC REFL WENF) 
.(PRON PERS OFF .PERSON. .GENDER. .CASE. .NUNBER. ) 


DAS, WAS, DA; 
(PRON PERS RELMOO) 


EIN, KEIN: 
(ENDINGS SUPPLY CASE & NUMBER) 
(PRON PERS INOEF ) 


DER, DEREN, WELCHE, ETC,: 
(PRON REL .CASE. .GENDER. “MRE. ) 


SELBST, ALL, ate 3 
(QUANT: MASS ) 


COUNT 
PARTICLE USED AS SEPARABLE PREFIX: 
(SEPPR) 


(VERB REG UML SEIN. TRANSITIVITY. ~SEPPR .SEPPR.) 
IRR --- —_- HABEN +SEPPR | 
MIXED —_INSEPPR 
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IF -GEPPR, THEN THE SEPARABLE PREFIX 1S GIVEN. FOR SEPARABLE PREFINES, BOTH 
PARTE OF Ti WORD WAT SE ENTERED SOMANIEY INTHE DICTIONARY, ALTHOUGH | 


TRANSITIVITIES ARE: 
A+D ReA UeA AdG AOA Re RAG AP MOE ROE ZAORGNWPE! 


I=INTRANSI TIVE AsACCUSATIVE DaDATIVE GaGENITIVE NeNOMENATIVE ReREFLEXIVE 
We (FOR HEMFALL) DATIVE REFLEXIVE PePREPOSITION AS OBJECT ‘Z-RANKGHIFTED NOUN 
GROUP E<#NY,ADVERBIAL. SOME VERBS MINE GBLIGATORY LEOATION,. TIVE, ETC. JUST 
WHICH CASE APPLIES 1S SPECIFIED ROUGHLY IN SEMANTIC resTarcvions. 

IF A VERB HAS MORE THAN ONE TRANSITIVITY RELATION, THESE ARE INCESDED 1 
PARENTHESES IN OME GEFINITION: RARER. THAN WMCING SEPARATE LISTS FOR EACH ONE. 
THESE LISTS ARE TEN ENPAMOED AUTOMATICALLY WEN ENOQUNTERED. 
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APPENDIX B. SAMPLE PARSE... 


EIN DEUTLICH SICHTBARES ZELOEN FOR FOR DIE i henvesstten VERLALFENOEN 
ERREGUNGEN IST DAS SPIEL .QER CHROMATORHO 


(C(EIN DEUTLICH SICHTBARES ZELCEN, FOR. 
ERREGUNGEN IST DAS SPIEL DER CHROMATOF 
(CLAUSE MAJOR TOPLEVEL DECLARATIVE PEAR ORO 
((EIN DEUTLICH SICHTBARES.ZEICHEN FOR DIE.1M MERVENGYSTEM. 
(NG NOM FULL NOUN DET INOEF NEUT SING. P3AD CONT? 

(EIN (DET INDEF SING NEUT NOM)) a 

((DEUTLICH SICHTBARES) (ADJG ATTR NEUT SING MIXED | sal 
(DEUTLICH (ADJ FELMOD, UNDECL COMP SUP)) ” a: 
(SICHTBARES (ADJ ATTR DECL MIXED NEUT NOM SING COMP SUP))) 
(ZEICHEN (NOUN STRONG NEUT NOM SING PRD GEN-S PLURS faze 


((FOR DIE IM NERVENSYSTEM VERLAUFENDEN ERAEGUNGEN) 
(PREPG SIMPLE) 


(FOR (PREP ACC PRE)) 


((DIE IM NERVENSYSTEN VERLAUFENDEN ERREGUNGEN) 
(NG ACC SIMPLE DET OEF NOUN PLUR NASC PRD COUNT) 


(DIE (DET DEF ACC FEM PLUR)) 


CCIM NERVENSYSTEN VERLAUFENDEN) 
(CLAUSE RSQ PRESP PRENOM NONEX SUBORDINATE FEM PLUR ACC WEAK) 


((IM NERVENSYSTEM) 
(PREPG NO-RSQ ADVERSIAL) 


(IM (PREP MIXED PRE)) 


(CIM. NERVENSYSTEM) 
(NG DAT NO-RSQ DET DEF NOUN NEUT SING P3RD COUNT) 


(IM (DET DEF NEUT DAT SING)) 


(NERVENSYSTEN : 
(NOUN STRONG NEUT DAT SING P3RD GEN-S PLUR-E COUNT) ))) 


(VERLAUFENDEN . 
(PART PRESP DECL WEAK FEN ACC PLUR ATTR IRR UML SEIN NO-GE © 
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MVB PLAIN P AS-VERBH)? | , 
(IST (VERB IRR MVB PLAIN. SEIN WO-END PRES INOTC-P3AO. SING)? 


({DAS SPIEL DER 
(NG NOM FUEL DER CHRO TOPHOREN TEUT. MXR P2R0. COUNT) 


(DAS (DET DEF NOM NEUT SINGT) 
(SPIEL (NOUN STRONG NEUT NOM SING P3RO.GEN-S GEN-ES PLUR-E COUNT)? 


((DER CHROMA TOPHOREN) 
De ee eT a eae ee 


(DER (DET DEF GEN MASC PLUR)) 
(CHROMATOPHOREN (NOUN PLUR-EN PLUR P3RD GEN. GEN-S. JMASC,.COUNT) 299) 


187 
APPENDIX C. A SECTION OF THE CONCEPT MARKER TREE 


EACH CONCEPT MARKER IS LINKED TO ITS PARENT BY THE UP PROPERTY ANO TO ITS 
DAUGHTERS, IF ANY, BY DOWN. THE ORDERING OF RESTRICTION LISTS CORRESPONDS TO 
THE ORDER OF THE SEMANTIC ARGUMENTS OF THE RELATION. 


(DEFS #MENTAL-PROCESS UP #RELATION 
DOWN (PERCEPTION AREACTION ACOGNI TION) ) 


(DEFS APERCEPTION UP AMENTAL-PROCESS 
DOWN (#SENSORY-INVOLUNTARY #SENSORY-VOLUNTARY) ) 


(DEFS ASENSORY-INVOLUNTARY UP #PERCEPTION 
DOWN (#DISTINGUISH #PERCEIVE)) 


(DEFS ADISTINGUISH UP #SENSORY-INVOLUNTARY 
RESTRICTIONS: (ALIVING-THING ACONCRETE ACONCRETE)) 


(DEFS #PERCEIVE UP #SENSORY-INVOLUNTARY 
DOWN (#SEE)) 


(DEFS ASEE UP #PERCEIVE 
RESTRICTIONS: (#ANIMAL ACONCRETE)) 


(DEFS ASENSORY-VOLUNTARY (#PERCEPTION) 
DOWN (#OBSERVE) ) 


(DEFS AOBSERVE UP #SENSORY-VOLUNTARY 
RESTRICTIONS: (#HUMAN AOBJECT)) 


(DEFS AREACTION UP #MENTAL-PROCESS 
DOWN (AWISH-FOR)) 


(DEFS AWISH-FOR UP #REACTION 
RESTRICTIONS: (#HUMAN (EITHER: ARELATION MOBJECT))) 


(DEFS ACOGNITION UP MENTAL-PROCESS 
DOWN (#NEUTRAL-COGNITION AVALUE-ASSIGNED) ) 


(DEFS ANEUTRAL-COGNITION UP #COGNITION 
DOWN (#KNOW)) 


(DEFS A#KNOW UP #ANEUTRAL-COGNI TION 
RESTRICTIONS: (#HUMAN AFACT)) 


(DEFS AVALUE-ASSIGNED UP ACOGNITION 
DOWN (4ASSUME)) 
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APPENDIX 0. SAMPLE SEMANTIC REPRESENTATION 
A sample reprseentation of the sentence, “Ein deut lich: sichtberes. Zeichen Or 
die im. Kec voroyetyy ver leutenden Erregungen: iet des Gpiel-ser Creonatophoren 


der Céphalopoden.” Seugntic structures: preduesd Yor fenee and. those produced 
by determiners have been omitted for clarity. 


{RSS 
MEQUATE 


HEAD HEAD 


META io 
_ KSOMETHING 
- 


RSS oe * 
HABLE | 
HEAD oss | 
RSS 
#SEE 
oss 
#PERSON 


RSS Oss = 
MHAVE-PROPERTY #CEPHALOPOD 

PSS 

#OPTIMUM-COGNI TIVE _ . | SEE-NEXT PAGE 
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APPENDIX 0. (COND.) 


.JSEE PREVIOUS PAGE 


RSS 
AINDICATE 


ass 
HEXCI TATION 


RSS 
HCONTAINMENT 


oss 
#NERVOUS-SYSTEM 
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